[Net 2000 Ltd. Home][DataBee Home][DataBee Manual][DataBee FAQ]

The DataBee Set Extractor
Misc. Setup Tab

The Misc. Setup Tab contains a few configuration items (locations of output and log files). These items are usually configured when the Extraction Set is loaded. In the event that the Extraction Set does not contain this information the configuration options can be manually updated. Most users generally prefer to make the changes in the Set Designer application and reload the Extraction Set.


The DataBee Set Extractor Misc. Setup Tab

Directory for Extracted Set File: The Set Extractor application uses the rules defined in the Extraction Set to identify and download the appropriate and relevant rows from each table in the source schema. It places this row information in a binary file called an Extracted Set File. This field lets you specify the directory into which the Extracted Set File should be written. The Extracted Set File directory information is generally configured in the Set Designer application - however it can be modified here by clicking on the Change button.

Note: Changes made to the Extracted Set File directory in the Set Extractor application are only temporary and will not be remembered after the Set Extractor application is closed. Use Set Designer application to permanently associate the new Extracted Set File directory with the Extraction Set.

Directory for Log Files: The Set Extractor application writes out a log of its progress as it extracts the data from the source schema. All DataBee applications must use the same Log File Directory. The Log File Directory setting in the Set Designer application always provides the location for the Set Extractor applications log directory. The Log File Directory cannot be modified here - change this value using the Set Designer and restart the Set Extractor application.

Extract Options
Use Read-Only Extract Mode: The Set Extractor can (optionally) perform extracts on read only databases. In READ-ONLY mode the DTB_RS_TMP temporary table is no longer required on the source database. Read-Only extracts are typically used in situations where the source database cannot be written to for some reason. An example of this may be when the source is an Oracle Stand-By database.

Enabling Read-Only mode does cause an increase in extract times. Tests indicate that extracts in READ-ONLY mode will typically take about three times longer than the same extract which uses the writeable temporary table. It is advised to leave READ-ONLY mode disabled unless you really need it.

The READ-ONLY option can be applied within the Set Designer and Set Extractor. If enabled within the Set Designer tool the READ-ONLY mode will be saved with the Extraction Set and passed to the Set Extractor as part of the Extraction Set configuration. Setting the READ-ONLY mode in the Set Extractor tool will cause it to activate only for the duration of the current extract. The Set Extractor cannot save the READ-ONLY mode state back to the Extraction Set.

What the buttons do
Reload Rules and Continue Extract
When developing a set of extraction rules a repeated cycle of "Add a Rule", "Run the Extract", "Add Another Rule" usually takes place. For larger extracts, a complete re-extraction to test a newly added rule can mean a considerable delay. Depending on the type of changes made to the rules it is not always necessary to completely re-extract. Under certain circumstances it is possible for the new rules to be reloaded and used with the currently extracted information. This continuation feature simulates the effect of a complete re-extract and dramatically speeds up the development process.

A rule re-load and continuation cannot happen with certain types of types of changes. The changes which will invalidate the option to reload and continue are:

  • Deletion of any rule.
  • Changing the WHERE clause, sample percentage or row limit options of WHERE rules.
  • Changing the columns, number of columns, or table information in WHERE or Table-to-Table rules.
  • Any sort of editing of an existing rule.
  • Disabling a rule.

The changes which can be made which will enable you to reload the rules and continue are:

  • Addition of rules.
  • Enabling a disabled rule.

If the rule changes are such that it is not possible to reload and continue, a dialog box will pop-up inform you of this fact. You will need to clear, reload the extraction set and perform a complete re-extract.

Send Extracted Row Counts
The Set Designer application displays the last Extracted Row count for each table in the Extraction Set in various dialog boxes. These counts greatly assist with the creation of new rules. By default, the extracted row counts for each table are only sent to the Set Designer application if the entire extraction successfully completes in the Set Extractor. This button will force the counts to be sent - even if the extract is in progress or if it was terminated before it could complete. This function is used only in development mode when working on extraction rules.

[Net 2000 Ltd. Home][DataBee Home][DataBee Manual][DataBee FAQ]