New Features in Version 5.0
New in This Release
Vector 5.0 contains the following new features:
• UUID data type and functions: Automatic generation of UUID identifiers for inserting data. A UUID can be used as a primary key and/or as a partition key to ensure that data is spread evenly across nodes.
• Clonedb utility, which lets you copy a database from one Vector instance to another, for example, from one installation, machine, or cluster to another. Clonedb can be used to clone a production database for testing purposes.
• SQL syntax for parallel vwload (COPY table() VWLOAD FROM 'file1', 'file2',…) performs the same operation as running vwload -c from the command line. Using SQL means the vwload operation can be part of a bigger transaction. A single transaction avoids the overhead of committing separate transactions and writing to disk. This is especially useful when loading data to apply updates.
• SQL syntax for CSV export (INSERT INTO EXTERNAL CSV 'filename'...) writes a table to a local file system. The result is either a single CSV file or a collection of CSV files, depending on whether the query is run in parallel.
• The Spark-Vector Connector has been enhanced to provide parallel unload.
• Distributed Write Ahead Log: The single LOG file has been split into multiple files stored in the wal directory. This feature improves performance, especially for large data sizes.
• Automatic histogram generation so you do not have to generate statistics for proper query execution. This feature gives you more flexibility in managing statistics. Histograms are automatically generated on all columns that appear in WHERE clauses and do not already have a histogram stored in the catalog. The histograms are generated from sample data maintained in memory by the min-max indexes.
• SET SERVER_TRACE and SET SESSION_TRACE statements allow tracing of all queries processed by the DBMS Server regardless of the source, whether it be an interactive query, or from a JDBC, ODBC, or .NET connection.