Site Map

DataBee creates referentially correct cut-down databases.



Key Features Icon


A summary of the DataBee software.

Overview Icon


A technical overview of how DataBee extracts and loads data subsets.



White Paper Icon


A comprehensive survey of the techniques and issues you need to know about before you begin data subsetting.



Download Icon


A 30 day, fully functional, evaluation copy of the DataBee software.



Quick Start Icon


A quick start to the DataBee software.

FAQ Icon


Answers to frequently asked questions about the DataBee software.

System Reqirements Icon


The hardware & software supported by DataBee.


DataBee Overview


DataBee consists of three applications - each of which is dedicated to a particular function. The division of the operational responsibilities between the three components (as opposed to using just one) makes the software easier to use.


Separating the DataBee software into several smaller modules rather than placing all of the functionality into a single larger application minimizes the number of decisions that you, the user, must make as the software is used. The architecture of the problem lends itself to such a solution. At any time in the process the requirement is either designing rules, extracting data or loading data. These are very separate actions and there is no need to have other functionality co-located in the same application. To do so would add complexity and present an array of irrelevant visual information. To put it bluntly, using separate applications reduces the learning curve because there are fewer decisions to be made. In any one DataBee application everything visible is always relevant to the task at hand.


The DataBee applications install on a Windows PC and connect to both the Oracle source and target databases via the Oracle standard TNSNames over SQL*Net method. The DataBee software acts as a manager for the extract and load process but the data operations always occur remotely. At no time does the data in the database travel from the source or target schemas to the PC on which the DataBee applications are installed. It is possible for the source and target schemas to be in databases created on completely different operating systems and they can also be in different Oracle database versions.


The DataBee Applications:

FAQ IconSet Designer

In order to extract rows from the source database, the DataBee software needs rules which indicate how the tables are related to each other. When loading a target schema with data the DataBee software needs to know which tables to load, and any possible column and data modifications which may be required. The Set Designer application contains tools to help build the collection of connection information, rules and table structure information required for an extract (called an Extraction Set) or for load operations (called a Loader Set). The Set Designer can discover and automatically add rules for object and relationships contained within the database as well as enable the manual addition, deletion and editing of rules.


FAQ IconSet Extractor

The Set Extractor application takes the Extraction Set created by the Set Designer and identifies the rows in the source Oracle database which will be required to populate the target database. Essentially the Set Extractor is a ROWID Hunter - all it does is identify the collection of rows (by ROWID) in the source schema which form a referentially correct subset. The output of the Set Extractor is a list of ROWID values in a temporary table stored in the source schema (or proxy schema). Although the Set Extractor is installed on a Windows PC, the Oracle schema being extracted can be located in a remote Oracle database on any operating system. All extraction operations take place on the remote database and the extracted ROWIDs are never transmitted to the PC running the Set Extractor application.


FAQ IconSet Loader

The Set Loader application uses a Loader Set (also created by the Set Designer application) and populates the target schema. There are two methods of loading: load via database link and load via direct copy. When loading via a database link, the target schema can be located on any Oracle database. The data is pulled through a database link which runs directly from the target schema into the source schema. When loading in direct copy mode, the target schema must be located in the same database as the source schema and it must have read permissions on the tables in the source schema. In both cases, the Set Loader application reads the ROWID values placed in the source schema temporary table by the Set Extractor application. The data is always transferred by direct transfer between the source and target schema and never travels down to the PC running the Set Loader application. Once the Set Loader has finished, the tables in the target database will be populated with a referentially correct subset of the data currently in the source database.


Further Information:

  • An overview of the subset extraction and load process.
  • A discussion of Extraction Sets.
  • A discussion of Loader Sets.
  • The DataBee Quick Start Training.