| [ << ] | [ < ] | [ Up ] | [ > ] | [ >> ] | [Top] | [Contents] | [Index] | [ ? ] |
22.3 Tables File Format
This section defines the file format of serialized flex tables.
The tables format allows for one or more sets of tables to be specified, where each set corresponds to a given scanner. Scanners are indexed by name, as described below. The file format is as follows:
TABLE SET 1
+-------------------------------+
Header | uint32 th_magic; |
| uint32 th_hsize; |
| uint32 th_ssize; |
| uint16 th_flags; |
| char th_version[]; |
| char th_name[]; |
| uint8 th_pad64[]; |
+-------------------------------+
Table 1 | uint16 td_id; |
| uint16 td_flags; |
| uint32 td_lolen; |
| uint32 td_hilen; |
| void td_data[]; |
| uint8 td_pad64[]; |
+-------------------------------+
Table 2 | |
. . .
. . .
. . .
. . .
Table n | |
+-------------------------------+
TABLE SET 2
.
.
.
TABLE SET N
The above diagram shows that a complete set of tables consists of a header followed by multiple individual tables. Furthermore, multiple complete sets may be present in the same file, each set with its own header and tables. The sets are contiguous in the file. The only way to know if another set follows is to check the next four bytes for the magic number (or check for EOF). The header and tables sections are padded to 64-bit boundaries. Below we describe each field in detail. This format does not specify how the scanner will expand the given data, i.e., data may be serialized as int8, but expanded to an int32 array at runtime. This is to reduce the size of the serialized data where possible. Remember, all integer values are in network byte order.
Fields of a table header:
th_magicMagic number, always 0xF13C57B1.
th_hsizeSize of this entire header, in bytes, including all fields plus any padding.
th_ssizeSize of this entire set, in bytes, including the header, all tables, plus any padding.
th_flagsBit flags for this table set. Currently unused.
th_version[]Flex version in NULL-terminated string format. e.g., ‘2.5.13a’. This is the version of flex that was used to create the serialized tables.
th_name[]Contains the name of this table set. The default is ‘yytables’, and is prefixed accordingly, e.g., ‘footables’. Must be NULL-terminated.
th_pad64[]Zero or more NULL bytes, padding the entire header to the next 64-bit boundary as calculated from the beginning of the header.
Fields of a table:
td_idSpecifies the table identifier. Possible values are:
YYTD_ID_ACCEPT (0x01)yy_acceptYYTD_ID_BASE (0x02)yy_baseYYTD_ID_CHK (0x03)yy_chkYYTD_ID_DEF (0x04)yy_defYYTD_ID_EC (0x05)yy_ecYYTD_ID_META (0x06)yy_metaYYTD_ID_NUL_TRANS (0x07)yy_NUL_transYYTD_ID_NXT (0x08)yy_nxt. This array may be two dimensional. See thetd_hilenfield below.YYTD_ID_RULE_CAN_MATCH_EOL (0x09)yy_rule_can_match_eolYYTD_ID_START_STATE_LIST (0x0A)yy_start_state_list. This array is handled specially because it is an array of pointers to structs. See thetd_flagsfield below.YYTD_ID_TRANSITION (0x0B)yy_transition. This array is handled specially because it is an array of structs. See thetd_lolenfield below.YYTD_ID_ACCLIST (0x0C)yy_acclist
td_flagsBit flags describing how to interpret the data in
td_data. The data arrays are one-dimensional by default, but may be two dimensional as specified in thetd_hilenfield.YYTD_DATA8 (0x01)The data is serialized as an array of type int8.
YYTD_DATA16 (0x02)The data is serialized as an array of type int16.
YYTD_DATA32 (0x04)The data is serialized as an array of type int32.
YYTD_PTRANS (0x08)The data is a list of indexes of entries in the expanded
yy_transitionarray. Each index should be expanded to a pointer to the corresponding entry in theyy_transitionarray. We count on the fact that theyy_transitionarray has already been seen.YYTD_STRUCT (0x10)The data is a list of yy_trans_info structs, each of which consists of two integers. There is no padding between struct elements or between structs. The type of each member is determined by the
YYTD_DATA*bits.
td_lolenSpecifies the number of elements in the lowest dimension array. If this is a one-dimensional array, then it is simply the number of elements in this array. The element size is determined by the
td_flagsfield.td_hilenIf
td_hilenis non-zero, then the data is a two-dimensional array. Otherwise, the data is a one-dimensional array.td_hilencontains the number of elements in the higher dimensional array, andtd_lolencontains the number of elements in the lowest dimension.Conceptually,
td_datais eithersometype td_data[td_lolen], orsometype td_data[td_hilen][td_lolen], wheresometypeis specified by thetd_flagsfield. It is possible for bothtd_lolenandtd_hilento be zero, in which casetd_datais a zero length array, and no data is loaded, i.e., this table is simply skipped. Flex does not currently generate tables of zero length.td_data[]The table data. This array may be a one- or two-dimensional array, of type
int8,int16,int32,struct yy_trans_info, orstruct yy_trans_info*, depending upon the values in thetd_flags,td_lolen, andtd_hilenfields.td_pad64[]Zero or more NULL bytes, padding the entire table to the next 64-bit boundary as calculated from the beginning of this table.
| [ << ] | [ < ] | [ Up ] | [ > ] | [ >> ] | [Top] | [Contents] | [Index] | [ ? ] |
This document was generated on November 4, 2011 using texi2html 5.0.
