gui.help.html Maven / Gradle / Ivy
Go to download
Show more of this group Show more artifacts with this name
Show all versions of data-generator Show documentation
Show all versions of data-generator Show documentation
This Java library can be used to create synthetic data files.
The newest version!
NAACCR XML Utility Help
Synthetic Data File Generator
Table of Contents
Introduction
This tool can be used to create synthetic (fabricated) NAACCR data files.
To achieve that goal, the tool uses a library that generates random data based on frequencies extracted from the SEER data.
Creating Random Data
The library uses three ways to assign values:
- Constant values: the rule always assigns the same value to the field.
- Random values from a list: the rule assigns a random value from a specific list of values.
- Random values based on a frequency: the rule uses a frequency (usually from a CSV file) to get the value to assign; this results in more common values being assigned more often.
In addition to those assignment mechanisms, each rule might have dependencies to the values assigned by previous rules.
About this tool
This tool was developed by Information Management Services, Inc. under the SEER program of the Surveillance Research Program, Division of Cancer Control and Population Sciences, National Cancer Institute.
© 2015 - 2024 Weber Informatics LLC | Privacy Policy