Data Item _entity_poly.pdbx_seq_one_letter_code_can

General

Item name
_entity_poly.pdbx_seq_one_letter_code_can
Category name
entity_poly
Attribute name
pdbx_seq_one_letter_code_can
Required in PDB entries
no
Used in current PDB entries
Yes, in about 100.0 % of entries

Item Description

Canonical sequence of protein or nucleic acid polymer in standard one-letter codes of amino acids or nucleotides, corresponding to the sequence in _entity_poly.pdbx_seq_one_letter_code. Non-standard amino acids/nucleotides are represented by the codes of their parents if parent is specified in _chem_comp.mon_nstd_parent_comp_id, or by letter 'X' if parent is not specified. Deoxynucleotides are represented by their canonical one-letter codes of A, C, G, or T. For modifications with several parent amino acids, all corresponding parent amino acid codes will be listed (ex. chromophores).

Item Example

 
MSHHWGYGKHNGPEHWHKDFPIAKGERQSPVDIDTHTAKYDPSLKPLSVSYDQATSLRILNNGAAFNVEFD

Data Type

Data type code
text
Data type detail
text item types / multi-line text ...
Primitive data type code
char
Regular expression
[][ \n\t()_,.;:"&<>/\{}'`~!@#$%?+=*A-Za-z0-9|^-]*

Aliases

Alias Item Name Dictionary Name Dictionary Version
_entity_poly.ndb_seq_one_letter_code_can cif_rcsb.dic 1.1