Creating a Data File

The transactional interface gives developers tremendous flexibility in optimizing database applications. In providing such flexibility, the transactional interface exposes a great deal of the inner workings of the transactional database engine. If you are new to the transactional interface, the Create (14) operation may appear quite complex to you, but you do not need all of the features this operation provides to get started. This section highlights the basic requirements by stepping you through the creation of a simple, transaction-based data file. For simplification where necessary, this section uses C interface terminology.

Note: In the same directory, no two files should share the same file name and differ only in their file name extension. For example, do not name a data file Invoice.btr and another one Invoice.mkd in the same directory. This restriction applies because the database engine uses the file name for various areas of functionality while ignoring the file name extension. Since only the file name is used to differentiate files, files that differ only in their file name extension look identical to the database engine.

Data Layout

This section uses an example data file that stores employee records. The application will retrieve employee information by providing either a unique employee ID or the employee’s last name. Because more than one employee can have the same last name, the database will allow duplicate values for the last name. Based on these criteria, the data layout for the file is as follows:


Information in Record	Data Type	Key/Index Characteristics
Last name	25 character string	Duplicatable
First name	25 character string	None
Middle initial	1 character string	None
Employee ID	4 byte integer	Unique
Phone number	13 character string	None
Pay rate per month	4 byte float	None

Now that the basic data layout is established, you can begin applying the terminology and requirements of the transactional interface. This includes determining information about the key structure and file structure before you actually create the file. You must work out these details in advance, because the Create (14) operation creates the file, index, and key information all at once. The following sections discuss the issues to consider in working out these details.

Key Attributes

First, determine any special functionality for the keys. The transactional database engine supports a variety of key attributes you can assign, as shown in the following table.

Table 13

Extended Data Type. Stores a transactional interface data type other than string or unsigned binary. Use this attribute, rather than the standard binary data type. This key attribute can accommodate the standard binary and string data types, plus many others.

BIN

Standard BINARY Data Type. Supported for historical reasons. Stores an unsigned binary number. Default data type is string.

DUP

Linked Duplicates. Allows duplicate values, which are linked by pointers from the index page to the data page. For more information, refer to Duplicatable Keys.

REPEAT_DUPS_KEY

Repeating Duplicates. Allows duplicate values, which are stored on both the index page and the data page. For more information, refer to Duplicatable Keys.

MOD

Modifiable. Allows the key value to be modified after the record is inserted.

SEG

Segmented. Specifies that this key has a segment that follows the current key segment.

NUL

Null Key (All Segments). Excludes any records from the index if all segments of the key contain a specified null value. (You assign the null value when you create the file.)

MANUAL_KEY

Null Key (Any Segment). Excludes any records from the index if any segment in the key contains a specified null value. (You assign the null value when you create the file.)

DESC_KEY

Descending Sort Order. Orders the key values in descending order (highest to lowest). Default is ascending order (lowest to highest).

NOCASE_KEY

Case Insensitive. Sorts string values without distinguishing upper and lower case letters. Do not use if the key has an alternate collating sequence (ACS). In the case of a Null Indicator segment, this attribute is overloaded to indicate that non-zero null values should be treated distinctly.

ALT

Alternate Collating Sequence. Uses an ACS to sort string keys differently from the standard ASCII collating sequence. Different keys can use different ACSs. You can specify the default ACS (the first one defined in the file), a numbered ACS defined in the file, or a named ACS defined in the COLLATE.CFG system file.

NUMBERED_ACS

NAMED_ACS

For simplicity, these constants, defined in btrconst.h, are consistent with the C interface. Some interfaces may use other names or no constants at all. For bit masks, hexadecimal, and decimal equivalents for the key attributes, refer to the Btrieve API Guide.

You assign these key attributes for each key you define. Each key has its own key specification. If the key has multiple segments, you have to provide the specification for each segment. Some of these attributes can have different values for different segments within the same key. Using the previous example, the keys are the last name and the employee ID. Both keys use extended types; the last name is a string and the employee ID is an integer. Both are modifiable, but only the last name is duplicatable. In addition, the last name is case insensitive.

Regarding the data type you assign to a key, the transactional interface does not validate that the records you input adhere to the data types defined for the keys. For example, you could define a TIMESTAMP key in a file, but store a character string there or define a date key and store a value for February 30. Your transactional interface application would work fine, but an ODBC application that tries to access the same data might fail, because the byte format could be different and the algorithms used to generate the timestamp value could be different. For complete descriptions of the data types, refer to the SQL Engine Reference.

File Attributes

Next, determine any special functionality for the file.

The transactional database engine supports a variety of file attributes you can assign, as follows:

Table 14

Variable Length Records. Use in files that contain variable length records.

BLANK_TRUNC

Blank Truncation. Conserves disk space by dropping any trailing blanks in the variable-length portion of the record. Applicable only to files that allow variable-length records and that do not use data compression. For more information, refer to Blank Truncation.

PRE_ALLOC

Page Preallocation. Reserves contiguous disk space for use by the file as it is populated. Can speed up file operations if a file occupies a contiguous area on the disk. The increase in speed is most noticeable on very large files. For more information, refer to Page Preallocation.

DATA_COMP

Data Compression. Compresses records before inserting or updating them and uncompresses records when retrieving them. For more information, refer to Record Compression.

KEY_ONLY

Key-Only File. Includes only one key, and the entire record is stored with the key, so no data pages are required. Key-only files are useful when your records contain a single key and that key takes up most of each record. For more information, refer to Key-Only Files.

BALANCED_KEYS

Index Balancing. Rotates values from full index pages onto index pages that have space available. Index balancing enhances performance during read operations, but may require extra time during write operations. For more information, refer to Index Balancing.

FREE_10
FREE_20
FREE_30

Free Space Threshold. Sets the threshold percentage for reusing disk space made available by deletions of variable length records, thus eliminating the need to reorganize files and reducing the fragmentation of variable-length records across several pages.

A larger Free Space Threshold reduces fragmentation of the variable-length portion of records which increases performance. However, it requires more disk space. If higher performance is desired, increase the Free Space Threshold to 30 percent.

DUP_PTRS

Reserve Duplicate Pointers. Preallocates pointer space for linked duplicatable keys added in the future. If no duplicate pointers are available for creating a linked-duplicatable key, the transactional database engine creates a repeating-duplicatable key.

INCLUDE_SYSTEM_DATA

System Data. Includes system data upon file creation, which allows the transactional database engine to perform transaction logging on the file. This is useful in files that do not contain a unique key.

NO_INCLUDE_SYSTEM_DATA

SPECIFY_KEY_NUMS

Key Number. Allows you to assign a specific number to a key, rather than letting the transactional database engine assign numbers automatically. Some applications may require a specific key number.

VATS_SUPPORT

Variable-tail Allocation Tables (VATs). Uses VATs (arrays of pointers to the variable-length portion of the record) to accelerate random access and to limit the size of the compression buffer used during data compression. For more information, refer to Variable-tail Allocation Tables.

The example data file does not use any of these file attributes, because the records are fixed-length records of small size.

For definitions of file attributes, refer to File Types. For more information about specifying file attributes during the Create operation, refer to the Btrieve API Guide.

Creating File and Key Specification Structures

When you use the Create operation, you pass in a data buffer that contains file and key specification structures. The following structure uses the example employee data file.

Table 15

Sample Data Buffer for File and Key Specifications

Logical Fixed Record Length. (Size of all fields combined: 25 + 25 + 1 + 4 + 13 + 4). For instructions, refer to Calculating the Logical Record Length.3

512

A minimum size of 4096 bytes works best for most files. If you want to fine-tune this, refer to Choosing a Page Size for more information.

6.0 to 8.0 file formats support page sizes of 512 times x, where x is any number up to the product 4,096.

9.0 file format supports page sizes identical to previous versions except that it also supports a page size of 8,192.

9.5 file format supports page sizes of 1,024 times 2 0 thru 4.

When creating 9.5 format files, if the logical page size specified is valid for the file format, the MicroKernel rounds the specified value to the next higher valid value if one exists. For all other values and file formats, the operation fails with status 24. No rounding is done for the older file formats.

Number of Keys. (Number of keys in the file: 2)

Use database engine default

Reserved. (Not used during a Create operation.)

Reserved

6- 9

File Flags. Specifies the file attributes. The example file does not use any.

Short Int

10, 11

Number of Extra Pointers. Sets the number of duplicate pointers to reserve for future key additions. Used if the file attributes specify Reserve Duplicate Pointers.

Byte

Reserved. (Not used during a Create operation.)

Reserved

Preallocated Pages. Sets the number of pages to preallocate. Used if the file attributes specify Page Preallocation.

Short Int

14, 15

Key Specification for Key 0 (Last Name)

Key Position. Provides the position of the first byte of the key within the record. The first byte in the record is 1.

Short Int

16, 17

Key Length. Specifies the length of the key, in bytes.

Short Int

18, 19

Key Flags. Specifies the key attributes.

Short Int

20, 21

EXTTYPE_KEY + NOCASE_KEY + DUP + MOD

Not Used for a Create.

Byte

22-25

Extended Key Type. Used if the key flags specify Use Extended Key Type. Specifies one of the extended data types.

Byte

ZSTRING

Null Value (legacy nulls only). Used if the key flags specify Null Key (All Segments) or Null Key (Any Segment). Specifies an exclusion value for the key. See Null Value for more conceptual information on legacy nulls and true nulls.

Byte

Not Used for a Create.

Byte

28, 29

Manually Assigned Key Number. Used if the file attributes specify Key Number. Assigns a key number.

Byte

ACS Number. Used if the key flags specify Use Default ACS, Use Numbered ACS in File, or Use Named ACS. Specifies the ACS number to use.

Byte

Key Specification for Key 1 (Employee ID)

Key Position. (Employee ID starts at first byte after Middle Initial.)

Not Used for a Create.

Not Used for a Create.

Byte

44, 45

Manually Assigned Key Number.

Byte

ACS Number.

Byte

Key Specification for Page Compression

Physical Page Size5

Char

512
(default value)

1Unless specified otherwise, all data types are unsigned.

2For simplification, the non-numeric example values are for C applications.

3For files with variable-length records, the logical record length refers only to the fixed-length portion of the record.

4Short Integers (Short Int) must be stored in the “Little Endian” byte order, which is the Low To High ordering of Intel-class computers.

5Only used with page level compression. Must be used in conjunction with the Page Compression file flag (see Table 6). See also Creating a File with Page Level Compression for more information.

Creating a File with Page Level Compression

For Pervasive PSQL 9.5 and later, you can use the Create operation to create data files with page level compression. For earlier data files, logical pages map to physical pages, and this mapping is stored in a Page Allocation Table (PAT). A physical page is exactly the same size as a logical page.

When a file is compressed, each logical page is compressed into one or more physical page units that are smaller in size than a logical page. The physical page size is specified by the Physical Page Size attribute (see Table 15).

The Page Compression file flag (see Table 6) is used in conjunction with the Physical Page Size key specification to tell the MicroKernel to create the new data file with page level compression turned on. The logical and physical page sizes are validated as follows:

The value specified for the physical page size cannot be larger than the value specified for the logical page size. If it is then the MicroKernel will round down the value specified for the physical page size so that it is the same as the logical page size. The logical page size needs to be an exact multiple of the physical page size. If it is not then the logical page size is rounded down so that it becomes an exact multiple of the physical page size. If, as a result of these manipulations, the logical and physical values end up to be the same, then page level compression will not turned on for this file.

Calling the Create Operation

The Create operation (14) requires the following values:

•

Operation Code, which is 14 for a Create.

•

Data Buffer containing the file and key specifications.

•

Length of the Data Buffer.

•

Key Buffer containing the full path for the file.

•