Class Table

java.lang.Object
com.mckoi.database.Table
All Implemented Interfaces:
TableDataSource
Direct Known Subclasses:
AbstractDataTable, CompositeTable, FilterTable, JoinedTable

public abstract class Table extends Object implements TableDataSource
This is a definition for a table in the database. It stores the name of the table, and the fields (columns) in the table. A table represents either a 'core' DataTable that directly maps to the information stored in the database, or a temporary table generated on the fly.

It is an abstract class, because it does not implement the methods to add, remove or access row data in the table.

Author:
Tobias Downer
  • Field Summary

    Fields
    Modifier and Type
    Field
    Description
    protected static boolean
     
  • Constructor Summary

    Constructors
    Modifier
    Constructor
    Description
    protected
    The Constructor.
  • Method Summary

    Modifier and Type
    Method
    Description
    all(QueryContext context, Expression lhs, Operator op, Table table)
    Evaluates a non-correlated ALL type operator given the LHS expression, the RHS subquery and the ALL operator to use.
    final boolean
    allColumnMatchesValue(int column, Operator op, TObject ob)
    Returns true if the given column contains all values that the given operator returns true for with the given value.
    any(QueryContext context, Expression lhs, Operator op, Table right_table)
    Evaluates a non-correlated ANY type operator given the LHS expression, the RHS subquery and the ANY operator to use.
    final boolean
    columnContainsCell(int column, TObject cell)
    Returns true if the given cell is found in the table.
    final boolean
    columnContainsValue(int column, TObject ob)
    Returns true if the given column number contains the value given.
    final boolean
    columnMatchesValue(int column, Operator op, TObject ob)
    Returns true if the given column contains a value that the given operator returns true for with the given value.
    final Table
    Returns a Table that is a merge of this table and the destination table.
    static boolean
    Compares cell1 with cell2 and if the given operator evalutes to true then returns true, otherwise false.
    Returns a DebugLogger object that we can use to log debug messages to.
    Deprecated.
    - not a proper SQL distinct.
    final Table
    distinct(int[] col_map)
    Returns a new table that has only distinct rows in it.
    final void
    Dumps the contents of the table in a human readable form to the given output stream.
    final Table
    Returns a new Table based on this table with no rows in it.
    final Table
    Exhaustively searches through this table for rows that match the expression given.
    final int
    A faster way to find a column index given a string column name.
    abstract int
    Given a fully qualified variable field name, ie.
    abstract TObject
    getCellContents(int column, int row)
    Returns an object that represents the information in the given cell in the table.
    abstract int
    Returns the number of columns in the table.
    getColumnDefAt(int col_index)
    Returns the DataTableColumnDef object for the given column index.
    getColumnScheme(int column)
    Returns the SelectableScheme that indexes the given column in this table.
    abstract Database
    Returns the Database object that this table is derived from.
    abstract DataTableDef
    Returns a DataTableDef object that defines the name of the table and the layout of the columns of the table.
    final TObject
    getFirstCellContent(int column)
    Returns the TObject value that represents the first item in the set or null if there are no items in the column set.
    final TObject[]
    getFirstCellContent(int[] col_map)
    Returns the TObject value that represents the first item in the set or null if there are no items in the column set.
    final TObject
    getLastCellContent(int column)
    Returns the TObject value that represents the last item in the set or null if there are no items in the column set.
    final TObject[]
    getLastCellContent(int[] col_map)
    Returns the TObject value that represents the last item in the set or null if there are no items in the column set.
    abstract Variable
    getResolvedVariable(int column)
    Returns a fully qualified Variable object that represents the name of the column at the given index.
    abstract int
    Returns the number of rows stored in the table.
    final TObject
    If the given column contains all items of the same value, this method returns the value.
    final TObject[]
    getSingleCellContent(int[] col_map)
    If the given column contains all items of the same value, this method returns the value.
    Returns the TransactionSystem object that this table is part of.
    This returns an object that can only access the cells that are in this table, and has no other access to the 'Table' class's functionality.
    getTTypeForColumn(int column)
    Returns a TType object that would represent values at the given column index.
    Returns a TType object that would represent values in the given column.
    abstract boolean
    Returns true if the table has its row roots locked (via the lockRoot(int) method.
    final Table
    join(Table table)
    Performs a natural join of this table with the given table.
    abstract void
    lockRoot(int lock_key)
    Locks the root table(s) of this table so that it is impossible to overwrite the underlying rows that may appear in this table.
    orderByColumn(int col_index, boolean ascending)
    Returns a Table which is identical to this table, except it is sorted by the given column name.
     
    orderByColumn(Variable column, boolean ascending)
     
    final Table
    orderByColumns(int[] col_map)
    Returns a table that is ordered by the given column numbers.
    orderedRowList(int[] col_map)
    Returns an IntegerVector that represents the list of rows in this table in sorted order by the given column map.
    outside(Table rtable)
    Finds all rows in this table that are 'outside' the result in the given table.
    void
    printGraph(PrintStream out, int indent)
    Prints a graph of the table hierarchy to the stream.
    final Table
    rangeSelect(Variable col_var, SelectableRange[] ranges)
    A single column range select on this table.
    Returns an Enumeration of the rows in this table.
    Returns a list of rows that represents the enumerator order of this table.
    selectAll(int column)
    Returns an array that represents the sorted order of this table by the given column number.
    selectFirst(int column)
    Returns an array that represents the first sorted element(s) of the given column number.
    selectLast(int column)
    Returns an array that represents the last sorted element(s) of the given column number.
    selectRange(int column, SelectableRange[] ranges)
    Returns an array that represents the sorted order of this table of all values in the given SelectableRange objects of the given column index.
    selectRest(int column)
    Returns an array that represents the rest of the sorted element(s) of the given column number.
    final Table
    simpleJoin(QueryContext context, Table table, Variable lhs_var, Operator op, Expression rhs)
    A simple join operation.
    final Table
    simpleSelect(QueryContext context, Variable lhs_var, Operator op, Expression rhs)
    A simple select on this table.
    final Table
    singleRowSelect(int row_index)
    Selects a single row at the given index from this table.
    Assuming this table is a 2 column key/value table, and the first column is a string, this will convert it into a map.
    Returns a string that represents this table.
    final Table
    union(Table table)
    Returns a new Table that is the union of the this table and the given table.
    abstract void
    unlockRoot(int lock_key)
    Unlocks the root tables so that the underlying rows may once again be used if they are not locked and have been removed.

    Methods inherited from class java.lang.Object

    clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait
  • Field Details

    • DEBUG_QUERY

      protected static boolean DEBUG_QUERY
  • Constructor Details

    • Table

      protected Table()
      The Constructor. Requires a name and the fields in the table.
  • Method Details

    • getDatabase

      public abstract Database getDatabase()
      Returns the Database object that this table is derived from.
    • getSystem

      public final TransactionSystem getSystem()
      Returns the TransactionSystem object that this table is part of.
      Specified by:
      getSystem in interface TableDataSource
    • Debug

      public DebugLogger Debug()
      Returns a DebugLogger object that we can use to log debug messages to.
    • getColumnCount

      public abstract int getColumnCount()
      Returns the number of columns in the table.
    • getRowCount

      public abstract int getRowCount()
      Returns the number of rows stored in the table.
      Specified by:
      getRowCount in interface TableDataSource
    • getTTypeForColumn

      public TType getTTypeForColumn(int column)
      Returns a TType object that would represent values at the given column index. Throws an error if the column can't be found.
    • getTTypeForColumn

      public TType getTTypeForColumn(Variable v)
      Returns a TType object that would represent values in the given column. Throws an error if the column can't be found.
    • findFieldName

      public abstract int findFieldName(Variable v)
      Given a fully qualified variable field name, ie. 'APP.CUSTOMER.CUSTOMERID' this will return the column number the field is at. Returns -1 if the field does not exist in the table.
    • getResolvedVariable

      public abstract Variable getResolvedVariable(int column)
      Returns a fully qualified Variable object that represents the name of the column at the given index. For example, new Variable(new TableName("APP", "CUSTOMER"), "ID")
    • getCellContents

      public abstract TObject getCellContents(int column, int row)
      Returns an object that represents the information in the given cell in the table. This will generally be an expensive algorithm, so calls to it should be kept to a minimum. Note that the offset between two rows is not necessarily 1. Use 'rowEnumeration' to get the contents of a set.
      Specified by:
      getCellContents in interface TableDataSource
    • rowEnumeration

      public abstract RowEnumeration rowEnumeration()
      Returns an Enumeration of the rows in this table. Each call to 'RowEnumeration.nextRowIndex()' returns the next valid row in the table. Note that the order that rows are retreived depend on a number of factors. For a DataTable the rows are accessed in the order they are in the data file. For a VirtualTable, the rows are accessed in the order of the last select operation.

      If you want the rows to be returned by a specific column order then use the 'selectxxx' methods.

      Specified by:
      rowEnumeration in interface TableDataSource
    • getDataTableDef

      public abstract DataTableDef getDataTableDef()
      Returns a DataTableDef object that defines the name of the table and the layout of the columns of the table. Note that for tables that are joined with other tables, the table name and schema for this object become mangled. For example, a table called 'PERSON' joined with a table called 'MUSIC' becomes a table called 'PERSON#MUSIC' in a null schema.
      Specified by:
      getDataTableDef in interface TableDataSource
    • lockRoot

      public abstract void lockRoot(int lock_key)
      Locks the root table(s) of this table so that it is impossible to overwrite the underlying rows that may appear in this table. This is used when cells in the table need to be accessed 'outside' the lock. So we may have late access to cells in the table. 'lock_key' is a given key that will also unlock the root table(s). NOTE: This is nothing to do with the 'LockingMechanism' object.
    • unlockRoot

      public abstract void unlockRoot(int lock_key)
      Unlocks the root tables so that the underlying rows may once again be used if they are not locked and have been removed. This should be called some time after the rows have been locked.
    • hasRootsLocked

      public abstract boolean hasRootsLocked()
      Returns true if the table has its row roots locked (via the lockRoot(int) method.
    • getColumnScheme

      public SelectableScheme getColumnScheme(int column)
      Returns the SelectableScheme that indexes the given column in this table.
      Specified by:
      getColumnScheme in interface TableDataSource
    • getColumnDefAt

      public DataTableColumnDef getColumnDefAt(int col_index)
      Returns the DataTableColumnDef object for the given column index.
    • dumpTo

      public final void dumpTo(PrintStream out) throws IOException
      Dumps the contents of the table in a human readable form to the given output stream. This should only be used for debuging the database.
      Throws:
      IOException
    • emptySelect

      public final Table emptySelect()
      Returns a new Table based on this table with no rows in it.
    • singleRowSelect

      public final Table singleRowSelect(int row_index)
      Selects a single row at the given index from this table.
    • columnMerge

      public final Table columnMerge(Table table)
      Returns a Table that is a merge of this table and the destination table. The rows that are in the destination table are included in this table. The tables must have
    • rangeSelect

      public final Table rangeSelect(Variable col_var, SelectableRange[] ranges)
      A single column range select on this table. This can often be solved very quickly especially if there is an index on the column. The SelectableRange array represents a set of ranges that are returned that meet the given criteria.
      Parameters:
      col_var - the column variable in this table (eg. Part.id)
      ranges - the normalized (no overlapping) set of ranges to find.
    • simpleSelect

      public final Table simpleSelect(QueryContext context, Variable lhs_var, Operator op, Expression rhs)
      A simple select on this table. We select against a column, with an Operator and a rhs Expression that is constant (only needs to be evaluated once).
      Parameters:
      context - the context of the query.
      lhs_var - the left has side column reference.
      op - the operator.
      rhs - the expression to select against (the expression must be a constant).
    • simpleJoin

      public final Table simpleJoin(QueryContext context, Table table, Variable lhs_var, Operator op, Expression rhs)
      A simple join operation. A simple join operation is one that has a single joining operator, a Variable on the lhs and a simple expression on the rhs that includes only columns in the rhs table. For example, 'id = part_id' or 'id == part_id * 2' or 'id == part_id + vendor_id * 2'

      It is important to understand how this algorithm works because all optimization of the expression must happen before the method starts.

      The simple join algorithm works as follows: Every row of the right hand side table 'table' is iterated through. The select opreation is applied to this table given the result evaluation. Each row that matches is included in the result table.

      For optimal performance, the expression should be arranged so that the rhs table is the smallest of the two tables (because we must iterate through all rows of this table). This table should be the largest.

    • exhaustiveSelect

      public final Table exhaustiveSelect(QueryContext context, Expression exp)
      Exhaustively searches through this table for rows that match the expression given. This is the slowest type of query and is not able to use any type of indexing.

      A QueryContext object is used for resolving sub-query plans. If there are no sub-query plans in the expression, this can safely be 'null'.

    • any

      public Table any(QueryContext context, Expression lhs, Operator op, Table right_table)
      Evaluates a non-correlated ANY type operator given the LHS expression, the RHS subquery and the ANY operator to use. For example;

         Table.col > ANY ( SELECT .... )
       

      ANY creates a new table that contains only the rows in this table that the expression and operator evaluate to true for any values in the given table.

      The IN operator can be represented by using '= ANY'.

      Note that unlike the other join and select methods in this object this will take a complex expression as the lhs provided all the Variable objects resolve to this table.

      Parameters:
      context - the context of the query.
      lhs - the left has side expression. The Variable objects in this expression must all reference columns in this table.
      op - the operator to use.
      right_table - the subquery table should only contain on column.
    • all

      public Table all(QueryContext context, Expression lhs, Operator op, Table table)
      Evaluates a non-correlated ALL type operator given the LHS expression, the RHS subquery and the ALL operator to use. For example;

         Table.col > ALL ( SELECT .... )
       

      ALL creates a new table that contains only the rows in this table that the expression and operator evaluate to true for all values in the giventable.

      The NOT IN operator can be represented by using 'invalid input: '<'> ALL'.

      Note that unlike the other join and select methods in this object this will take a complex expression as the lhs provided all the Variable objects resolve to this table.

      Parameters:
      context - The context of the query.
      lhs - the left has side expression. The Variable objects in this expression must all reference columns in this table.
      op - the operator to use.
      table - The subquery table should only contain on column.
    • join

      public final Table join(Table table)
      Performs a natural join of this table with the given table. This is the same as calling the above 'join' with no conditional.
    • outside

      public final VirtualTable outside(Table rtable)
      Finds all rows in this table that are 'outside' the result in the given table. This is used in OUTER JOIN's. We perform a normal join, then determine unmatched joins with this function. We can then create an OuterTable with this result to make the completed table.

      'rtable' must be a decendent of this table.

    • union

      public final Table union(Table table)
      Returns a new Table that is the union of the this table and the given table. A union operation will remove any duplicate rows.
    • distinct

      public final VirtualTable distinct()
      Deprecated.
      - not a proper SQL distinct.
      Returns a new table with any duplicate rows in this table removed.
    • distinct

      public final Table distinct(int[] col_map)
      Returns a new table that has only distinct rows in it. This is an expensive operation. We sort over all the columns, then iterate through the result taking out any duplicate rows.

      The int array contains the columns to make distinct over.

      NOTE: This will change the order of this table in the result.

    • columnContainsValue

      public final boolean columnContainsValue(int column, TObject ob)
      Returns true if the given column number contains the value given.
    • columnMatchesValue

      public final boolean columnMatchesValue(int column, Operator op, TObject ob)
      Returns true if the given column contains a value that the given operator returns true for with the given value.
    • allColumnMatchesValue

      public final boolean allColumnMatchesValue(int column, Operator op, TObject ob)
      Returns true if the given column contains all values that the given operator returns true for with the given value.
    • orderByColumns

      public final Table orderByColumns(int[] col_map)
      Returns a table that is ordered by the given column numbers. This can be used by various functions from grouping to distinction to ordering. Always sorted by ascending.
    • orderedRowList

      public final IntegerVector orderedRowList(int[] col_map)
      Returns an IntegerVector that represents the list of rows in this table in sorted order by the given column map.
    • orderByColumn

      public final VirtualTable orderByColumn(int col_index, boolean ascending)
      Returns a Table which is identical to this table, except it is sorted by the given column name. This means that if you access the rows sequentually you will be reading the sorted order of the column.
    • orderByColumn

      public final VirtualTable orderByColumn(Variable column, boolean ascending)
    • orderByColumn

      public final VirtualTable orderByColumn(Variable column)
    • getTableAccessState

      public final TableAccessState getTableAccessState()
      This returns an object that can only access the cells that are in this table, and has no other access to the 'Table' class's functionality. The purpose of this object is to provide a clean way to access the state of a table without being able to access any of the row sorting (SelectableScheme) methods that would return incorrect information in the situation where the table locks (via LockingMechanism) were removed. NOTE: The methods in this class will only work if this table has its rows locked via the 'lockRoot(int)' method.
    • selectAll

      public final IntegerVector selectAll(int column)
      Returns an array that represents the sorted order of this table by the given column number.
    • selectAll

      public final IntegerVector selectAll()
      Returns a list of rows that represents the enumerator order of this table.
    • selectRange

      public final IntegerVector selectRange(int column, SelectableRange[] ranges)
      Returns an array that represents the sorted order of this table of all values in the given SelectableRange objects of the given column index. If there is an index on the column, the result can be found very quickly. The range array must be normalized (no overlapping ranges).
    • selectLast

      public final IntegerVector selectLast(int column)
      Returns an array that represents the last sorted element(s) of the given column number.
    • selectFirst

      public final IntegerVector selectFirst(int column)
      Returns an array that represents the first sorted element(s) of the given column number.
    • selectRest

      public final IntegerVector selectRest(int column)
      Returns an array that represents the rest of the sorted element(s) of the given column number. (not the 'first' set).
    • getFirstCellContent

      public final TObject getFirstCellContent(int column)
      Returns the TObject value that represents the first item in the set or null if there are no items in the column set.
    • getFirstCellContent

      public final TObject[] getFirstCellContent(int[] col_map)
      Returns the TObject value that represents the first item in the set or null if there are no items in the column set.
    • getLastCellContent

      public final TObject getLastCellContent(int column)
      Returns the TObject value that represents the last item in the set or null if there are no items in the column set.
    • getLastCellContent

      public final TObject[] getLastCellContent(int[] col_map)
      Returns the TObject value that represents the last item in the set or null if there are no items in the column set.
    • getSingleCellContent

      public final TObject getSingleCellContent(int column)
      If the given column contains all items of the same value, this method returns the value. If it doesn't, or the column set is empty it returns null.
    • getSingleCellContent

      public final TObject[] getSingleCellContent(int[] col_map)
      If the given column contains all items of the same value, this method returns the value. If it doesn't, or the column set is empty it returns null.
    • columnContainsCell

      public final boolean columnContainsCell(int column, TObject cell)
      Returns true if the given cell is found in the table.
    • compareCells

      public static boolean compareCells(TObject ob1, TObject ob2, Operator op)
      Compares cell1 with cell2 and if the given operator evalutes to true then returns true, otherwise false.
    • toMap

      public Map toMap()
      Assuming this table is a 2 column key/value table, and the first column is a string, this will convert it into a map. The returned map can then be used to access values in the second column.
    • fastFindFieldName

      public final int fastFindFieldName(Variable col)
      A faster way to find a column index given a string column name. This caches column name -> column index in a HashMap.
    • toString

      public String toString()
      Returns a string that represents this table.
      Overrides:
      toString in class Object
    • printGraph

      public void printGraph(PrintStream out, int indent)
      Prints a graph of the table hierarchy to the stream.