Open main menu
Home
Random
Recent changes
Special pages
Community portal
Preferences
About Wikipedia
Disclaimers
Incubator escapee wiki
Search
User menu
Talk
Dark mode
Contributions
Create account
Log in
Editing
T-tree
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
{{one source|date=June 2013}} {{Short description|Data structure in computer science}} {{For|T-branching|H tree|Iterated function system}} <!-- Image with unknown copyright status removed: [[Image:Ttreenonde.png|thumb|right|251px|An example of a T-tree node structure]] --> [[Image:T-tree-1.png|thumb|right|251px|An example T-tree]] In [[computer science]] a '''T-tree''' is a type of [[binary tree]] [[data structure]] that is used by [[main memory database|main-memory databases]], such as [[Datablitz]], [[eXtremeDB]], [[MySQL Cluster]], [[TimesTen|Oracle TimesTen]] and MobileLite. A T-tree is a [[Height-balanced tree|balanced]] index tree data structure optimized for cases where both the index and the actual data are fully kept in memory, just as a [[B-tree]] is an index structure optimized for storage on block oriented secondary storage devices like hard disks. T-trees seek to gain the performance benefits of in-memory tree structures such as [[AVL tree]]s while avoiding the large storage space overhead which is common to them. T-trees do not keep copies of the indexed data fields within the index tree nodes themselves. Instead, they take advantage of the fact that the actual data is always in main memory together with the index so that they just contain pointers to the actual data fields. The 'T' in T-tree refers to the shape of the node data structures in the original paper which first described this type of index.<ref>{{cite conference |url=https://archive.org/details/verylargedatabas0000inte |first1=Tobin J. |last1=Lehman |first2=Michael J. |last2=Carey |title=A Study of Index Structures for Main Memory Database Management Systems |location=Kyoto |date=25β28 August 1986 |conference=Twelfth International Conference on Very Large Databases (VLDB 1986) |isbn=0-934613-18-4 |url-access=registration }}</ref> ==Node structures== A T-tree node usually consists of pointers to the parent node, the left and right child node, an ordered array of data pointers and some extra control data. Nodes with two [[subtree]]s are called ''internal nodes'', nodes without [[subtree]]s are called ''leaf nodes'' and nodes with only one [[subtree]] are named ''half-leaf'' nodes. A node is called the ''bounding node'' for a value if the value is between the node's current minimum and maximum value, inclusively. [[Image:T-tree-2.png|thumb|right|251px|Bound values]] For each internal node, leaf or half leaf nodes exist that contain the predecessor of its smallest data value (called the ''greatest lower bound'') and one that contains the successor of its largest data value (called the ''least upper bound''). Leaf and half-leaf nodes can contain any number of data elements from one to the maximum size of the data array. Internal nodes keep their occupancy between predefined minimum and maximum numbers of elements ==Algorithms== ===Search=== * Search starts at the root node * If the current node is the bounding node for the search value then search its data array. Search fails if the value is not found in the data array. * If the search value is less than the minimum value of the current node then continue search in its left subtree. Search fails if there is no left subtree. * If the search value is greater than the maximum value of the current node then continue search in its right subtree. Search fails if there is no right subtree. ===Insertion=== * Search for a bounding node for the new value. If such a node exists then: ** check whether there is still space in its data array, if so then insert the new value and finish ** if no space is available then remove the minimum value from the node's data array and insert the new value. Now proceed to the node holding the greatest lower bound for the node that the new value was inserted to. If the removed minimum value still fits in there then add it as the new maximum value of the node, else create a new right subnode for this node. * If no bounding node was found then insert the value into the last node searched if it still fits into it. In this case the new value will either become the new minimum or maximum value. If the value doesn't fit anymore then create a new left or right subtree. If a new node was added then the tree might need to be rebalanced, as described below. ===Deletion=== * Search for bounding node of the value to be deleted. If no bounding node is found then finish. * If the bounding node does not contain the value then finish. * delete the value from the node's data array Now we have to distinguish by node type: * Internal node: If the node's data array now has less than the minimum number of elements then move the greatest lower bound value of this node to its data value. Proceed with one of the following two steps for the half leaf or leaf node the value was removed from. * Leaf node: If this was the only element in the data array then delete the node. Rebalance the tree if needed. * Half leaf node: If the node's data array can be merged with its leaf's data array without overflow then do so and remove the leaf node. Rebalance the tree if needed. ===Rotation and balancing=== A T-tree is implemented on top of an underlying [[self-balancing binary search tree]]. Specifically, Lehman and Carey's article describes a T-tree balanced like an [[AVL tree]]: it becomes out of balance when a node's child trees differ in height by at least two levels. This can happen after an insertion or deletion of a node. After an insertion or deletion, the tree is scanned from the leaf to the root. If an imbalance is found, one [[tree rotation]] or pair of rotations is performed, which is guaranteed to balance the whole tree. When the rotation results in an internal node having fewer than the minimum number of items, items from the node's new child(ren) are moved into the internal node. ==Performance and Storage== Although T-trees were once widely used for main-memory databases because of performance benefits, recent trends for very large main-memory databases put more emphasis on provisioning cost. With modern NOSQL database systems often storing trillions of records, the memory cost to store even a single index that includes actual values can exceed tens or even hundreds of terabytes. ==See also== * [[Tree (graph theory)]] * [[Tree (set theory)]] * [[Tree structure]] * [[Exponential tree]] ===Other trees=== * [[B-tree]] ([[2β3 tree]], [[2β3β4 tree]], [[B+ tree]], [[B*-tree]], [[UB-tree]]) * [[Dancing tree]] * [[Fusion tree]] * [[k-d tree|''k''-d tree]] * [[Octree]] * [[Quadtree]] * [[R-tree]] * [[Radix tree]] * [[Top tree]] ==References== <references /> ==External links== * [http://www.oracle.com/technology/products/timesten/htdocs/faq/technical_faq.html##6 Oracle TimesTen FAQ entry on index mentioning T-Trees] * [http://www.oracle.com/technology/products/timesten/pdf/wp/timesten_tech_wp_dec_2005.pdf Oracle Whitepaper: Oracle TimesTen Products and Technologies] * [http://www.dependability.org/wg10.4/timedepend/08-Rasto.pdf DataBlitz presentation mentioning T-Trees] * [http://code.google.com/p/ttree/ An Open-source T*-tree Library] {{CS-Trees}} {{DEFAULTSORT:T-Tree}} [[Category:Binary trees]] [[Category:Search trees]]
Edit summary
(Briefly describe your changes)
By publishing changes, you agree to the
Terms of Use
, and you irrevocably agree to release your contribution under the
CC BY-SA 4.0 License
and the
GFDL
. You agree that a hyperlink or URL is sufficient attribution under the Creative Commons license.
Cancel
Editing help
(opens in new window)
Pages transcluded onto the current version of this page
(
help
)
:
Template:CS-Trees
(
edit
)
Template:Cite conference
(
edit
)
Template:For
(
edit
)
Template:One source
(
edit
)
Template:Short description
(
edit
)