The Set Extractor Table Statistics tab is designed to provide information on the number of rows extracted for each table and the rate at which those rows have been extracted. The statistics are continuously updated during the extract operations and the count displayed on this tab is an accurate reflection of the number of rows that have been extracted in the source schema.
Only tables which have had data extracted by an extraction set run on a source schema by the Set Extractor application can be loaded by a loader set in the Set Loader application. For more information on the process of creating a referentially correct subset database please see the Overview of the Extract and Load Process help file.
A useful report detailing the contents of the Table Statistics tab can be generated using the Extraction Table Statistics Report button on the Misc. Setup tab of the Set Extractor application. This report can be auto-generated after each run of the extraction set using the Extraction Report Options sub-tab on the Set Designer Misc. Setup Tab.
The DataBee Set Extractor Table Statistics Tab
The panel in the the Table Statistics tab lists the tables known to the Rule Controller rules of the currently open extraction set. The panel is divided up into six vertical columns. These columns provide specific information on various statistics associated with the table. Clicking on the column heading sorts the display based on the information in that column.
What the columns in the Table Statistics Panel mean
- Tables in the Extraction Set
- This column shows table name. The schema owner of the table is determined by the Rule Controller to which it belongs. Because there can be more than one Rule Controller in an extraction set each table is displayed as a dependency below its Rule Controller.
- Src. Rows column
- This value is the estimated number of source rows in the table. This information is considered an estimate because it is not automatically updated to reflect any changes and the actual value can be different than the value recorded in the extraction set. The Set Extractor application does not need an accurate count of the rows in a table in order to function - it will always extract what is required. The Src. Row Count value is mostly used comparison purposes during the development of the extraction set. The source row counts can be refreshed by opening the extraction set in the Set Designer application and using the tools on the Extract Tools tab of the Edit Rule Controller form.
- Ext. Rows column
- This column shows the number of rows that were extracted for each table. This figure will be continuously updated as the extract progresses. Note that if the Src. Row Count estimate is inaccurate, it can sometimes appear as though more rows were extracted than are actually present. This is just an artifact of inaccuracy in the estimate of the total number of rows in the source table and not an actual error.
- Ext. Percent column
- This value is the number of extracted rows divided by the number of estimated source rows expressed as a percentage and serves to indicate the relative amount of rows extracted for that table.
- Matches Plan
- The Matches Plan column states how well the current extracted row count matches the goal specified in the Plan Type column. A value of Yes indicates that the current extraction state matches the plan and a value of No means the the current extracted row count does not match the desired plan goal.
- Plan Type
- The Plan Type column lists the Extraction Set Plan for that table. Planned extraction goals can be either Subset ([Subset]), All Rows ([all]), No Rows ([no rows]) or Unknown ([Unknown]). Note that although the Plan Type for a table can be changed in the Set Extractor application the resulting changes cannot be saved back to the extraction set. The Set Extractor application has no ability to save an extraction set. In order to permanently save a Plan Type change with an extraction set open the set in the Set Designer application and make the changes there.