Skip to content
NAACCR | North American Association of Central Cancer Registries
MyNAACCR
Learn More
NetLink
Contact Us
Site Map
Search
Text Size
Decrease
S
M
L
XL
Increase
About NAACCR
Bylaws
Executive Office and Staff
Gifts, Donations, and Bequests
NAACCR Board of Directors
NAACCR Business Meetings
Narrative Newsletter
Policies
Press Releases
Special Projects We Are Working On
Sponsoring Member Organizations
Standing Rules
Strategic Management Plan
The Annual Report and 990 Tax Forms
The NAACCR Mission
Certification
Criteria
Certification Levels
Logos
Who is Certified
Data and Publications
Annual Report to the Nation
Call for Data
Canadian Cancer Statistics
Cancer in North America (CINA) Publications
Data Quality Assessments and Evaluations
Epidemiologic Reports
NAACCR Annotated Bibliography on Confidentiality in Data Release
Request Publications
Top 5 Cancers in the U.S.
Education and Training
Annual Conference
Cancer Registry and Surveillance Webinar Series
CTR Preparation and Review Webinar Series
Mentor Fellowship Program
Process Improvement Program
Recruitment Materials
Resources & Train. Progs. for CTRs
Town Hall Webinars
Membership
Awards Program
Become a NAACCR Member
Committees
Membership Communications
Membership Directory
MyNAACCR and Membership Directory FAQ
View or Post Job Opportunities
Research
CINA Data Products Overview
CINA Deluxe Analytic File
CINA Monograph In SAS
CINA+ in SEER*Stat
CINA+ Online
Confidentiality Issues
Data Analysis Tools
GIS Resources
HIPAA Information
IRB Information
Research Capabilities by Registry
Shortest Path Finder Tool
Standards and Registry Operations
Add/Change Data Items in Volume II
Add/Change Standards in Volume III
Auto. Tumor Linkage Group Documents
CCR Calendar of Operations
CRC Checksum
Implementation Guidelines
Interoperability Resources
National Interstate Data Exchange Agreement
NAACCR v12 Translation Tools
Path Lab Search Terms
Record Consolidation
Reg. Operations Guidelines
Security Showcase
Volume I, Data Exchange Standards and Record Description
Volume II, Data Standards and Data Dictionary
Volume III, Standards for Completeness, Quality, Analysis, Management, Security, and Confidentiality of Data
Volume IV, Standard Data EDITS
Volume V, Pathology Laboratory Electronic Reporting
← Back
Computational Aspects of Statistical Confidentiality (CASC) Project
At-a-glance
Application
Level
Registry Users
Customers Users
Tools
Intermediate
Steward
Research/Surveillance
Research
Link:
http://neon.vb.cbs.nl/CASC/
Overview:
The CENEX-SDC (CENters for EXcellence Statistical Disclosure Control) Project is sponsored by the European community and the CASC project is a CENEX project to research, develop and implement new techniques for statistical disclosure control. The project, based in the Netherlands and with participants from Spain, the U.K., and Germany, has developed two software tools, known collectively as the Argus Twins and named μ-Argus and τ-Argus respectively, that work to protect both microdata and tabular data. Latest versions and manuals of both software products are freely available from the CASC website.
The software utilizes many techniques to prevent data intruders from re-identifying individuals and I think that it is well worth recommending that cancer registries be aware of this group (CENEX-SDC).
1. μ-Argus In the case of microdata, where each record represents an individual, the danger facing statistical data agencies is that individuals from whom statistical data has been collected, may be re-identified by data intruders.
For example, a data file in the Netherlands might contain this information: residence=Urk, gender=female, and occupation=statistician. Urk happens to be a small village and there might be only one female statistician living there, so that person may easily be re-identified through use of just these three key fields.
In order to create a safe public microdata file the data file must be modified. The techniques applied by the μ-Argus software include global recoding (or regrouping of categorical data), local suppression (in the example just given, the occupation of the woman might be suppressed, Post Randomization Method or PRAM, which deliberately misclassifies some categorical data based on a probabilistic model, and/or noise may be introduced into the data.
The μ-Argus manual includes a tutorial with a sample data file to guide the new user through the application. An ASCII fixed file is used as the input file along with a file of metadata, which contains the record layout for Argus to use. The manual also includes extensive discussion of the theory of SDC.
[See also the
NAACCR Record Uniqueness Program
entry]
2. τ-Argus For tabular data, the τ-Argus software helps identify cells that might reveal information on individuals and produces safe tables that mask an individuals identity from third party viewers. The program uses a dominance rule to find sensitive cells within a table. The dominance rule states that a cell of a table is unsafe for public use if a few (n) major contributors to a cell are responsible for a large percentage (k) of the total of that particular cell. The table creator may supply their own levels, but n=3 and k=70% is commonly used. Cells that are risky or unsafe are suppressed.
But due to the presence of marginals in a table it is often easy to recalculate these suppressed cells. So additional cells must be suppressed to prevent this recalculation of the primary unsafe cells. It is not only enough to prevent exact recalculation but also to guarantee a safety range to protect the primary unsafe cells. The optimal selection of these secondary cells, as to avoid unnecessary high losses in the information content of the protected tables, is a very complex numerical optimization problem. It cannot be applied to many tables if the tables have a hierarchical structure. The hierarchical structures imply many more (sub-)marginals, which can be used to recalculate these primary suppressed cells.
The τ-Argus manual contains chapters describing the theoretical background, a reference chapter, and a step-by-step set of procedures for users who want to process a table or set of tables. The program can also be used to process a microdata file from which tables can then be constructed.
Search Site
Search Publications
Search