Posts Relationship Between Biological Assembly and Asymmetric Unit in mmCIF Format Files
Post
Cancel

Relationship Between Biological Assembly and Asymmetric Unit in mmCIF Format Files

Key Concepts

Asymmetric Unit

The asymmetric unit is the smallest portion of a crystal structure to which symmetry operations can be applied in order to generate the complete unit cell (the crystal repeating unit). Symmetry operations most common to crystals of biological macromolecules are rotations, translations and screw axes (combinations of rotation and translation).A crystal asymmetric unit may contain:[1]

  • one biological assembly
  • a portion of a biological assembly
  • multiple biological assemblies

Biological Assembly

The biological assembly (also sometimes referred to as the biological unit) is the macromolecular assembly that has either been shown to be or is believed to be the functional form of the molecule. For example, the functional form of hemoglobin has four chains.

Depending on the particular crystal structure, symmetry operations consisting of rotations, translations or their combinations may need to be performed in order to obtain the complete biological assembly. Alternately, a subset of the deposited coordinates may need to be selected to represent the biological assembly. Thus, a biological assembly may be built from: [1]

  • one copy of the asymmetric unit
  • multiple copies of the asymmetric unit
  • a portion of the asymmetric unit
KeyCategory Description
_entity.idOne concept on which the PDBx/mmCIF format relies is that of entities. An entity is a chemically distinct part of a structure as represented in the PDBx/mmCIF data file. Data items in the _entity category, describe the chemistry and identity of the molecules under investigation. In any particular entry, there may be multiple copies of a given entity.[2] More Details
_entity_poly.entity_id, _entity_poly.pdbx_strand_id, _entity_poly.typeData items in the ENTITY_POLY category record details about the polymer, such as the type of the polymer.[3]
_pdbx_struct_assembly_gen.assembly_id, _pdbx_struct_assembly_gen.oper_expression, _pdbx_struct_assembly_gen.asym_id_listData items in the PDBX_STRUCT_ASSEMBLY_GEN category record details about the generation of each macromolecular assemblies. The PDBX_STRUCT_ASSEMBLY_GEN data items provide the specifications of the components that constitute that assembly in terms of cartesian transformations.[3]
_pdbx_struct_assembly.oligomeric_countData items in the PDBX_STRUCT_ASSEMBLY category record details about the structural elements that form macromolecular assemblies.[3]

Explanation

The hierarchical frameworks of asym_ids (fig 1) The distribution of asym_ids in biological assemblies (fig 2)

Examples

One copy of the asymmetric unit

2ZQT

The asymmetric unit of PDB ID 2zqt The biological assembly of PDB ID 2zqt
1
2
3
4
5
6
7
8
9
pdb_id                                                                                    2zqt
_entity.id                                                                        [1, 2, 3, 4]
_entity_poly.entity_id                                                                     [1]
_entity_poly.pdbx_strand_id                                                                [A]
_entity_poly.type                                                             [polypeptide(L)]
_pdbx_struct_assembly_gen.assembly_id                                                      [1]
_pdbx_struct_assembly_gen.oper_expression                                                  [1]
_pdbx_struct_assembly_gen.asym_id_list                                               [A,B,C,D]
_pdbx_struct_assembly.oligomeric_count                                                     [1]

_pdbx_struct_assembly_gen.asym_id_list $\Rightarrow$ [A,B,C,D]


6IWG

The asymmetric unit of PDB ID 6iwg The biological assembly of PDB ID 6iwg
1
2
3
4
5
6
7
8
9
pdb_id                                                                                    6iwg
_entity.id                                                               [1, 2, 3, 4, 5, 6, 7]
_entity_poly.entity_id                                                               [1, 2, 3]
_entity_poly.pdbx_strand_id                                                          [A, B, C]
_entity_poly.type                             [polypeptide(L), polypeptide(L), polypeptide(L)]
_pdbx_struct_assembly_gen.assembly_id                                                      [1]
_pdbx_struct_assembly_gen.oper_expression                                                  [1]
_pdbx_struct_assembly_gen.asym_id_list       [A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T,U,V,W...
_pdbx_struct_assembly.oligomeric_count                                                     [3]

_pdbx_struct_assembly_gen.asym_id_list $\Rightarrow$ [A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T,U,V,W


Multiple copies of the asymmetric unit

1HHO

The asymmetric unit of PDB ID 1hho The biological assembly of PDB ID 1hho
1
2
3
4
5
6
7
8
9
pdb_id                                                                                    1hho
_entity.id                                                                  [1, 2, 3, 4, 5, 6]
_entity_poly.entity_id                                                                  [1, 2]
_entity_poly.pdbx_strand_id                                                             [A, B]
_entity_poly.type                                             [polypeptide(L), polypeptide(L)]
_pdbx_struct_assembly_gen.assembly_id                                                      [1]
_pdbx_struct_assembly_gen.oper_expression                                                [1,2]
_pdbx_struct_assembly_gen.asym_id_list                                     [A,B,C,D,E,F,G,H,I]
_pdbx_struct_assembly.oligomeric_count                                                     [4]

_pdbx_struct_assembly_gen.asym_id_list $\Rightarrow$ [A,B,C,D,E,F,G,H,I] $* 2$


2AKE

The asymmetric unit of PDB ID 2ake The biological assembly of PDB ID 2ake
1
2
3
4
5
6
7
8
9
pdb_id                                                                                    2ake
_entity.id                                                                        [1, 2, 3, 4]
_entity_poly.entity_id                                                                  [1, 2]
_entity_poly.pdbx_strand_id                                                             [B, A]
_entity_poly.type                                         [polyribonucleotide, polypeptide(L)]
_pdbx_struct_assembly_gen.assembly_id                                                      [1]
_pdbx_struct_assembly_gen.oper_expression                                                [1,2]
_pdbx_struct_assembly_gen.asym_id_list                                               [A,B,C,D]
_pdbx_struct_assembly.oligomeric_count                                                     [4]

_pdbx_struct_assembly_gen.asym_id_list $\Rightarrow$ [A,B,C,D] $* 2$


A portion of the asymmetric unit

2XYN

The asymmetric unit of PDB ID 2xyn The biological assembly 1 of PDB ID 2xyn The biological assembly 2 of PDB ID 2xyn The biological assembly 3 of PDB ID 2xyn
1
2
3
4
5
6
7
8
9
pdb_id                                                                                    2xyn
_entity.id                                                                     [1, 2, 3, 4, 5]
_entity_poly.entity_id                                                                     [1]
_entity_poly.pdbx_strand_id                                                            [A,B,C]
_entity_poly.type                                                             [polypeptide(L)]
_pdbx_struct_assembly_gen.assembly_id                                                [1, 2, 3]
_pdbx_struct_assembly_gen.oper_expression                                            [1, 1, 1]
_pdbx_struct_assembly_gen.asym_id_list                       [A,D,E,F,G,N, B,H,I,J,O, C,K,L,M]
_pdbx_struct_assembly.oligomeric_count                                               [1, 1, 1]

_pdbx_struct_assembly_gen.asym_id_list $\Rightarrow$ [A,D,E,F,G,N, B,H,I,J,O, C,K,L,M]


3A6P

The asymmetric unit of PDB ID 3a6p The biological assembly 1 of PDB ID 3a6p The biological assembly 2 of PDB ID 3a6p
1
2
3
4
5
6
7
8
9
pdb_id                                                                                    3a6p
_entity.id                                                               [1, 2, 3, 4, 5, 6, 7]
_entity_poly.entity_id                                                         [1, 2, 3, 4, 5]
_entity_poly.pdbx_strand_id                                          [A,F, B,G, C,H, D,I, E,J]
_entity_poly.type                            [polypeptide(L), polypeptide(L), polypeptide(L)...
_pdbx_struct_assembly_gen.assembly_id                                                   [1, 2]
_pdbx_struct_assembly_gen.oper_expression                                               [1, 1]
_pdbx_struct_assembly_gen.asym_id_list                          [A,B,C,D,E,K,L, F,G,H,I,J,M,N]
_pdbx_struct_assembly.oligomeric_count                                                  [5, 5]

_pdbx_struct_assembly_gen.asym_id_list $\Rightarrow$ [A,B,C,D,E,K,L, F,G,H,I,J,M,N]


Special

a portion of the asymmetric unit, (a portion of the asymmetric unit + multiple copies of the asymmetric unit)

1DFV

The asymmetric unit of PDB ID 1dfv The biological assembly 1 of PDB ID 1dfv The biological assembly 2 of PDB ID 1dfv The biological assembly 3 of PDB ID 1dfv
1
2
3
4
5
6
7
8
9
pdb_id                                                                                    1dfv
_entity.id                                                                     [1, 2, 3, 4, 5]
_entity_poly.entity_id                                                                     [1]
_entity_poly.pdbx_strand_id                                                              [A,B]
_entity_poly.type                                                             [polypeptide(L)]
_pdbx_struct_assembly_gen.assembly_id                                                [1, 2, 3]
_pdbx_struct_assembly_gen.oper_expression                                          [1, 1, 1,2]
_pdbx_struct_assembly_gen.asym_id_list                         [A,C,D,E,F,I, B,G,H,J, B,G,H,J]
_pdbx_struct_assembly.oligomeric_count                                               [1, 1, 2]

_pdbx_struct_assembly_gen.asym_id_list $\Rightarrow$ [A,C,D,E,F,I, B,G,H,J, B,G,H,J]

  • BioUnit1: A,C,D,E,F,I, $\rightarrow * 1$
  • BioUnit2: B,G,H,J, $\rightarrow * 1$
  • BioUnit3: B,G,H,J,$\rightarrow * 2$

a portion of the asymmetric unit + multiple copies of the asymmetric unit

3HL2

The asymmetric unit of PDB ID 3hl2 The biological assembly 1 of PDB ID 3hl2 The biological assembly 2 of PDB ID 3hl2
1
2
3
4
5
6
7
8
9
pdb_id                                                                                    3hl2
_entity.id                                                                  [1, 2, 3, 4, 5, 6]
_entity_poly.entity_id                                                                  [1, 2]
_entity_poly.pdbx_strand_id                                                       [A,B,C,D, E]
_entity_poly.type                                         [polypeptide(L), polyribonucleotide]
_pdbx_struct_assembly_gen.assembly_id                                             [1, 1, 2, 2]
_pdbx_struct_assembly_gen.oper_expression                                         [1, 2, 3, 2]
_pdbx_struct_assembly_gen.asym_id_list       [A,B,F,G,H,I,J,K,Q,R, A,B,E,F,G,H,I,J,K,Q,R,U, C,D,L,M,N,O,P,S,T, C,D,E,L,M,N,O,P,S,T,U]
_pdbx_struct_assembly.oligomeric_count                                                  [5, 5]

_pdbx_struct_assembly_gen.asym_id_list $\Rightarrow$ [A,B,F,G,H,I,J,K,Q,R, A,B,E,F,G,H,I,J,K,Q,R,U, C,D,L,M,N,O,P,S,T, C,D,E,L,M,N,O,P,S,T,U]

  • BioUnit1: A,B,F,G,H,I,J,K,Q,R, $+$ A,B,E,F,G,H,I,J,K,Q,R,U,
  • BioUnit2: C,D,L,M,N,O,P,S,T, $+$ C,D,E,L,M,N,O,P,S,T,U

Reference

  1. http://pdb101.rcsb.org/learn/guide-to-understanding-pdb-data/biological-assemblies#Anchor-BioUnit
  2. https://pdb101.rcsb.org/learn/guide-to-understanding-pdb-data/beginner%E2%80%99s-guide-to-pdb-structures-and-the-pdbx-mmcif-format
  3. http://mmcif.rcsb.org/pdbx-mmcif-home-page.html
  4. http://proteopedia.org/wiki/index.php/Biological_Unit
This post is licensed under CC BY 4.0 by the author.