Biorepository and Database

Select the area of interest for details about the CARET Biorepository and Database.

Specimens

Collection

The CARET Biorepository includes biologic material from the 18,314 CARET participants. Participants enrolled in the two pilot studies which preceded CARET (all of whom remain part of CARET) had blood collected at the first visit, the 4th-month visit, and each annual visit. Participants enrolled after the pilot had blood collected at the first visit and at every other annual visit. Blood draws ended in 1997, one year after the CARET intervention had been stopped. Serum samples were collected at all draws. Plasma collection occurred throughout the duration of the pilot studies (1985 – 1988), and continued at Vanguard/Efficacy visits through February 1990. Collection of whole blood and dried blood spots was attempted at a single time point for each participant during visits conducted from 1994-1997. The Specimen Collection Timeline (PDF) shows the type, quantity, and timing of blood specimen collection.

Tumor and normal tissue, in the form of pathology slides and paraffin blocks, was collected from 1995 to 1997 and from 1999 to 2003. However, collection was both prospective and retrospective; attempts to obtain tissue were made for all confirmed lung cancer cases reported by June 2003.

Storage
Blood aliquots were stored at -20°C at the study centers immediately after the draw. Within two weeks of collection, samples were shipped on dry ice to the Coordinating Center, where they entered long-term storage at -70°C.

Availability

Summaries of CARET specimens available for collaborative research are provided in the links below. Requests for additional information about the CARET biorepository and specimen bank, including detailed reports to support development of study proposals, may be sent to CARET@fredhutch.org.

CARET Blood Specimen Summary
CARET Tissue Specimen Summary

Endpoints

Collection and Adjudication

The endpoints review process has undergone several procedural modifications over the 20-year history of CARET. During the intervention phase and continuing through the first year of post-intervention follow-up (1985 - 1997), the procedure was as follows. We obtained medical records and pathology reports for all participants who reported a cancer diagnosis. We also requested slides or tissue blocks for lung, mesothelioma, and unknown primaries for central pathology review. Pathology reports were reviewed for all other cancers. Endpoint materials were reviewed independently by three reviewers from CARET's Endpoints Review Committee (ERC). If there was disagreement among the reviewers as to the primary site, histology, or date of diagnosis, the case was reviewed and closed during a meeting of the ERC. Prior to 1995, tissue specimens were returned after central pathology review.
From March 1997 through September 1998, a streamlined endpoint review and closure process was followed. Three changes were implemented for this period: 1) a single reviewer adjudicated all cases independently, unless questions arose; 2) for all cancers other than lung cancer, only pathology reports were required to confirm the endpoint; and 3) only the death certificate was required to confirm cause of death. The process was further streamlined in October 1998, allowing for review by endpoint specialists from the CARET Coordinating Center staff and a single reviewer from the ERC.

The final change during the active follow-up phase came in October 2003 when, as a cost-saving measure, we eliminated the collection and review of pathology reports and began relying solely on self-reports of cancer. Our decision to make this change was based on an evaluation of the accuracy of participant self-report of cancer, especially lung cancer, our primary endpoint. We examined the adjudication outcome of participant reported cancers and found that 91% of cases reported as lung cancer were closed as such. Most of the discrepancies in reporting were due to reporting recurrences or reporting metastases as new primaries. The confirmation rates were high for other commonly reported cancers as well, including prostate (96%), breast (94%), bladder (97%), and colorectal (88%). These findings assured us that self-report of cancer was an acceptable procedure. Based on this evaluation, we made several modifications to the data collection questionnaire to address the reasons for self-reported diagnoses with the review of medical records. In the majority of cases, this was concerned with differentiating a new primary cancer from recurrent, metastatic, and benign diseases. Thus, actual rates of agreement during the self-reporting period (October 2003 - June 2005) are likely higher than we observed in our evaluation.

State cancer registry and National Death Index (NDI) linkages were conducted in 2015 to extend follow up for endpoints. NDI linkage was complete through 2013; state cancer registry linkage was through 2012 for WA and CA, and through 2013 for CT. Cancer registries of the five states in which CARET enrollments occurred were targeted for the linkage, with approval obtained for three registries: California Cancer Registry, Connecticut Tumor Registry, and Washington State Cancer Registry. Regulatory guidelines in Maryland and Oregon at present prohibit access to cancer registry data for the aims of the current CARET grant. Of the 18,314 total CARET participants, 12,927 (71%) were enrolled by a study center located in CA, CT, or WA; and another 1,109 (6%) enrolled by other study centers had a history of residence in one of the three states. In total, records from 14,036 (77%) participants were included in the cancer registry linkages. Probabilistic software was used to match CARET records to cancer registry records based on data elements common to both repositories, including full social security number, name, sex, race, and date of birth (which are complete for 99% of CARET participants). Linkage results were classified as matches to previously reported CARET endpoints or new endpoints based on degree of matching between common data elements (e.g., date of diagnosis, cancer site, and histologic type) and manual review of pathology reports and other medical records collected during active follow up. The registry data were also used to adjudicate cases closed previously based on self-report only.

NDI linkage was performed to ascertain vital status and underlying cause of death on a total of 13,830 participants who were alive at the end of active follow-up or for whom report of death had not been confirmed by death certificate. Matches were determined on the basis of the NDI probabilistic score and classification group assignment, electronic evaluation of CARET data elements not incorporated in the NDI algorithms (e.g., date of last contact), and manual review of records. Discrepancies in cause of death and cancer diagnoses between NDI and cancer registry records were reviewed as part of the adjudication process.

Cancer Incidence and Mortality

Number of participants with cancer endpoints by cancer type and intervention arm assignment
Number of deceased participants by cause of death and intervention arm assignment

Cancer Staging

Medical records obtained for adjudication were reviewed subsequently to determine TNM staging on the four most common CARET cancer endpoints: lung, prostate, breast, and colorectal cancer. Extensive medical records, including pathology reports, surgical reports, hospital records, and scan and radiograph reports, as well as clinical notes by the medical personnel attending the patient, were obtained for most cancers. Information was sufficient to determine staging on 75% of cases across the four cancer types reviewed. Almost 50% of cases that could not be staged were cancers reported during the self-reporting period (October 2003 - June 2005), after the collection of medical records was discontinued. Staging summary data, updated in November 2015, are provided in the links below:

Number of CARET Lung Cancer Primaries by Stage of Disease
Number of CARET Prostate Cancer Primaries by Stage of Disease
Number of CARET Breast Cancer Primaries by Stage of Disease
Number of CARET Colorectal Cancer Primaries by Stage of Disease

Data Collection Forms and Contact Schedule

   Contact schedule1
Χ = Collecton was at all time points unless otherwise noted
 CV = Clinic Visit / PC = Phone Contact / MQ = Mailed Questionnaire 
    Post-intervention phases
  Data Collections Forms Baseline
 (1985-1994) 
 Intervention Phase 
 (1985-1995) 
 Transition
  (1996-1997) 
 Follow-Up
 (1997-2005) 
   CV (x1) CV (xl/year)
 PC (x2/year)
 CV (x1)
 PC (x2)
 PC or MQ (1x/year) 
Cancer and death endpoints Confirmation of Cancer Endpoint 
 Confirmation of Death Endpoint
N/AΧΧΧ
Blood specimens Blood Collection and ProcessingΧΧ
(CVs, every two years only)
Χ
(CV only)
 
Dietary intake4 Food Frequency Questionnaire ΧΧ
(CVs, every two years only)
Χ
(CV only)
 
Smoking history Questionnaire #1 (baseline, Q.1-Q.7)
 Health Questionnaire #4 (follow-up, Q27) 
 ΧΧΧΧ
Supplemental vitamin use Contact Summary (Q.89) ΧΧ
(CVS only)
Χ 
Health history Health Questionnaire #4ΧΧ
(CVs only)
Χ 
Family history of cancer Questionnaire #1 (Q.9-Q.11)Χ Χ 
Respiratory history Respiratory History QuestionnaireΧΧ
(CVs, every two years only)
Χ
(CV only)
 
Spirometry
(asbestos workers only)
 SpirometryΧΧ
(CVs every two years there after)
Χ
(CV only)
 
X-ray
(asbestos workers only)
 Study Center X-ray ReviewΧ   
Work history Questionnaire #1 (Q.13-Q.17)Χ   
Demographics
(age, reace, sex, marital status3
and education level) 
 Questionnaire #1 (Q.18-Q.21)
 Contact Information Update
Χ   
Height, weight, blood pressure Contact Summary (Q.59-Q.61)ΧΧ
(CVs only)
Χ
(CV only)
 

1 Reflects contact schedule for most participants; participants enrolled during the pilot phase were contacted more frequently than those enrolled during the full-scale trials. Contact schedule specifics can be found here.
2 Examples of commonly used data collection forms are provided. Forms were modified over the course of the study, but data items within the questionnaires were generally consistent, if not identical.
3 Marital status was collected at all contacts
4 The CARET Food Frequency Questionnaire (FFQ) was designed to be especially sensitive to the measurement of fruits and vegetables and their nutrients, such as carotenoids. This FFQ is divided into three sections: (1) seven adjustment questions on types of foods and preparation techniques (e.g., usual types of fat used in cooking and at the table), which are used to alter how the analysis software calculates the fat and fiber content of specific foods; (2) 110 line items with questions on the frequency of use over the last year (from “never or less than once a month” to “2+ per day” for foods and “6+ per day” for beverages) and portion size (small, medium, or large, compared to the stated medium portion size); and (3) two summary questions on the usual consumption of fruits and vegetables, which are necessary to reduce the measurement error biased toward overreporting food intake. The nutrient database is derived from the University of Minnesota Nutrition Coordinating Center (NCC) database and included the 1998 USDA-NCC carotenoid database for U.S. foods.


Click here to download the latest version of Adobe Reader.