CiNA Public Use Data Set

CiNA Public Use Data Set

The NAACCR CiNA Public Use Research File is a publicly accessible dataset requiring only a signed Data Assurance Agreement for access. CiNA Public Use is a non-confidential dataset with a limited set of variables, accessed through the SEER*Stat software. This dataset has the most complete North American coverage of any other publicly available cancer dataset. CiNA Public Use allows a user to generate counts, rates and trends within SEER*Stat.

CiNA Public Use datasets are updated annually after publication of the CiNA Explorer statistics and are typically released in July. At that time, CiNA Public Use users will have their access removed automatically from any older Public Use datasets and can request access to the new dataset via DaRT.

This dataset includes age in both the 20 and 21 age group categories for diagnosis year 2010 forward. For diagnosis years prior to 2010, a historic dataset with 19 age groups is also available.

  • Access to these data is open to the public for research or fact finding.

  • Researchers may request access to the CiNA Public Use Research File by submitting an application to NAACCR via DaRT.

The variable list for the CiNA Public Use Research File and all other CiNA Research Files are available here: Variable list

 

Citation

Please reference the CiNA Public Use Research File as indicated in the SEER*Stat session. For example: SEER*Stat Database: NAACCR Incidence Data – CiNA Research Data, 2010-2022, Public Use (20 Age Groups) (which includes data from CDC’s National Program of Cancer Registries (NPCR), CCR’s Provincial and Territorial Registries, and the NCI’s Surveillance, Epidemiology and End Results (SEER) Registries), certified by the North American Association of Central Cancer Registries (NAACCR) as meeting high-quality incidence data standards for the specified time periods, submitted December 2024.

 

Small Numbers

  • To preserve confidentiality of the data, data are automatically suppressed for counts less than 6 based on potentially linkable variables (registry, sex, age, race, race/ethnicity, year of diagnosis and site).
  • For statistical stability, we advise caution in interpreting rates and other results based on fewer than 25 cases.

 

SEER*Stat

The CiNA Public Use Research File is distributed within SEER*Stat. SEER*Stat statistical software is a standard tool for analysis of cancer-related data. Additional information on SEER*Stat is available on the NCI, SEER site: http://seer.cancer.gov/seerstat/ . Tutorials are available here: http://seer.cancer.gov/seerstat/tutorials/.

Delay factors, survival statistics, and prevalence are not available in the CiNA Public Use Research File To be included in the CiNA Public Use Research File, data from a central registry from the US or Canada must meet specific data quality standards. All Gold and Silver NAACCR-certified central registries are eligible for inclusion in the CiNA Public Use Research File. Each central registry must also consent to the use of their data in the CiNA Public dataset. A current list of certified registries is available here: https://www.naaccr.org/certified-registries/. Registries may not have been certified in prior years, but if their data quality improves over time, their data are included in CiNA. However, not all states meet the data quality criteria for each year and will have zero counts for those years. Please review the spreadsheet Registry Data Fitness for Use By Data Year available here: https://www.naaccr.org/cina-data-products-overview/.

 

Data Collection

Cancer registry data are collected in an on-going, systematic, and standardized process. In Canada, the cancer registry collection program is overseen by the Canadian Council 3 of Cancer Registries. In the US, there are two cancer registry collection programs—the National Cancer Institute’s Surveillance, Epidemiology and End Results (SEER Program) and the Center for Disease Control’s National Program of Cancer Registries (NPCR). Data for all three programs is collected in a coordinated process from hospitals and other medical facilities, including inpatient, outpatient, and standalone facilities. The data are collected or overseen by certified tumor registrars, who are highly trained medical professionals, to ensure complete and high-quality data collection.

Detailed coding information is available in the NAACCR Data Standards & Data Dictionary available here: https://apps.naaccr.org/data-dictionary/.  Please note, the variables available in the CiNA Public Use Research File are a subset of the full variable list collected. Many variables in the CiNA Public Use Research File are aggregated or recoded for ease of analysis and may not match the original variable codes.

Copyright © 2018 NAACCR, Inc. All Rights Reserved | naaccr-swoosh-only See NAACCR Partners and Sponsors