STR Profile Database

Searchable STR Database for Human Cell Lines

As part of our continuing efforts to characterize and authenticate the cell lines in the Cell Biology collection, ATCC is developing a comprehensive database of short tandem repeat (STR) DNA profiles for all of our human cell lines.

Background

Short tandem repeat (STR) loci are among the most informative polymorphic markers in the human genome. Studies have shown that a minimum of eight STR markers are required to positively identify human cell lines. Use of 8 core STR loci enables a 1 in 108 discrimination rate for unrelated individuals.1 ATCC generates human STR profiles by simultaneously amplifying eight STR loci (D5S818, D13S317, D7S820, D16S539, vWA, TH01, TPOX, CSF1PO) and amelogenin (for gender determination) using the Promega PowerPlex® 1.2 system. Amplicons are separated by capillary electrophoresis and analyzed using Genemapper® ID 3.2.1 software from Applied Biosystems. Each relevant peak in the resulting electropherogram represents an allele which is alphanumerically scored and entered into the database.

As part of our quality control procedures and commitment to cell line authentication, we have developed a comprehensive database of STR DNA profiles of all ATCC human cell lines. STR profiles help ensure the quality and integrity of human cell lines in the scientific community. After determining the STR profiles of your human cell lines, you may compare them to the human cell lines in the ATCC STR database.
Recent enhancements include:

  • Updated algorithms providing more specific results
  • Search results that can be sorted by category
  • Excel-exportable search results

If you have questions about STR profiles or this database, please contact a technical service representative. ATCC encourages citations and/or references to this database and the data contained therein may be cited in publications.

As in the past, when we find a misidentified cell line among our holdings (i.e., the DNA profile is similar or identical to that of an unrelated cell line), we will post a note on the Misidentified Cell Lines page of our website.

How to use the database

    1. Log in to the ATCC website to access the STR database query form.
    2. For each query, enter either (1) an ATCC catalog number, OR (2) at least 7 of the 8 STR loci.
      • Separate each allele entry with a comma (e.g., CSF1PO = 11, 12); a space after the comma is not required.
      • Blank entries at any loci will be treated as null values.
      • The amelogenin gene is a genetic marker used for gender determination and is not comprised of STR units. Leaving the amelogenin field blank will not affect the results, since the algorithm is based on STR statistics independent of gender.
    3. Click “Submit”.

To access the search form you are required to login. Please click here to go to the login page and use the E-mail Address and Password used when you created your profile. Don't have a profile? Create One

How to interpret results

    1. The “ATCC Number” field is given precedence if data is entered into both the “ATCC Number” field and the STR loci field.
    2. Results are presented in descending order based on the highest percent match to the query.
    3. Results will only be returned when at least 7 out of 8 loci match the query.
      •  A cell line is considered to be “identical” to a culture in the ATCC STR database when the entered STR profile yields a 100% match (all 8 of the 8 loci) to the result set.
      •  A cell line is considered to have a “similar profile” when the entered STR profile yields a result set that matches only 7 of the 8 STR loci.
    4. If a cell line STR profile varies by 2 or more loci, no results will be retrieved (e.g., LOH has occurred in a derivative cell line at two loci) a message will be displayed reading, “No Matching Records Found!”
      •  In such cases, the likelihood that the query represents the profile of an unrelated cell line, not matching any cultures currently listed in the ATCC STR database, is very high.

 

Reference:
1. Lins, A.M., et al. J. Forensic Sci. 43: 1168-1180, 1998.

Disclaimer:
Reference to this database and the data contained therein may be cited in publications, and ATCC encourages such citation or reference. While every reasonable effort has been made to assure the accuracy of these data, no warranty, express or implied, is made by ATCC as to their accuracy.

While ATCC has used the Promega PowerPlex Version 1.2 product in the creation of these data and recommends that researchers wishing to produce data for comparison also use this product, ATCC does not provide a general endorsement of this product or provide any warranty or representation regarding its quality or performance in the scientific community for the identification of human cell lines.

PowerPlex is a registered trademark of the Promega Corporation.
Genemapper ID is a registered trademark of Applied Biosystems.