Package Bio :: Package SCOP
[hide private]
[frames] | no frames]

Package SCOP

source code

SCOP: Structural Classification of Proteins.

The SCOP database aims to provide a manually constructed classification of
all know protein structures into a hierarchy, the main levels of which
are family, superfamily and fold.

* "SCOP":http://scop.mrc-lmb.cam.ac.uk/scop/

* "Introduction":http://scop.mrc-lmb.cam.ac.uk/scop/intro.html

* "SCOP parsable files":http://scop.mrc-lmb.cam.ac.uk/scop/parse/

The Scop object in this module represents the entire SCOP classification. It
can be built from the three SCOP parsable files, modified is so desired, and
converted back to the same file formats. A single SCOP domain (represented
by the Domain class) can be obtained from Scop using the domain's SCOP
identifier (sid).


nodeCodeDict  -- A mapping between known 2 letter node codes and a longer
                  description. The known node types are 'cl' (class), 'cf'
                  (fold), 'sf' (superfamily), 'fa' (family), 'dm' (domain), 
                  'sp' (species), 'px' (domain). Additional node types may
                  be added in the future.

Submodules [hide private]

Classes [hide private]
  Scop
The entire SCOP hierarchy.
  Node
A node in the Scop hierarchy
  Domain
A SCOP domain.
  Astral
Abstraction of the ASTRAL database, which has sequences for all the SCOP domains, as well as clusterings by percent id or evalue.
Functions [hide private]
 
cmp_sccs(sccs1, sccs2)
Order SCOP concise classification strings (sccs).
source code
 
parse_domain(str)
Convert an ASTRAL header string into a Scop domain.
source code
 
_open_scop_file(scop_dir_path, version, filetype) source code
Variables [hide private]
  nodeCodeDict = {'cf': 'fold', 'cl': 'class', 'dm': 'protein', ...
  _nodetype_to_code = {'class': 'cl', 'domain': 'px', 'family': ...
  nodeCodeOrder = ['ro', 'cl', 'cf', 'sf', 'fa', 'dm', 'sp', 'px']
  astralBibIds = [10, 20, 25, 30, 35, 40, 50, 70, 90, 95, 100]
  astralEvs = [10, 5, 1, 0.5, 0.1, 0.05, 0.01, 0.005, 0.001, 0.0...
  astralEv_to_file = {1e-50: 'e-50', 1e-25: 'e-25', 1e-20: 'e-20...
  astralEv_to_sql = {1e-50: 'e_50', 1e-25: 'e_25', 1e-20: 'e_20'...
  _domain_re = re.compile(r'>?([\w_\.]*)\s+([\w\.]*)\s+\(([^\)]*...
Function Details [hide private]

cmp_sccs(sccs1, sccs2)

source code 

Order SCOP concise classification strings (sccs).

a.4.5.1 < a.4.5.11 < b.1.1.1

A sccs (e.g. a.4.5.11) compactly represents a domain's classification. The letter represents the class, and the numbers are the fold, superfamily, and family, respectively.

parse_domain(str)

source code 

Convert an ASTRAL header string into a Scop domain.

An ASTRAL (http://astral.stanford.edu/) header contains a concise description of a SCOP domain. A very similar format is used when a Domain object is converted into a string. The Domain returned by this method contains most of the SCOP information, but it will not be located within the SCOP hierarchy (i.e. The parent node will be None). The description is composed of the SCOP protein and species descriptions.

A typical ASTRAL header looks like -- >d1tpt_1 a.46.2.1 (1-70) Thymidine phosphorylase {Escherichia coli}


Variables Details [hide private]

nodeCodeDict

Value:
{'cf': 'fold',
 'cl': 'class',
 'dm': 'protein',
 'fa': 'family',
 'px': 'domain',
 'sf': 'superfamily',
 'sp': 'species'}

_nodetype_to_code

Value:
{'class': 'cl',
 'domain': 'px',
 'family': 'fa',
 'fold': 'cf',
 'protein': 'dm',
 'species': 'sp',
 'superfamily': 'sf'}

astralEvs

Value:
[10,
 5,
 1,
 0.5,
 0.1,
 0.05,
 0.01,
 0.005,
...

astralEv_to_file

Value:
{1e-50: 'e-50',
 1e-25: 'e-25',
 1e-20: 'e-20',
 1e-15: 'e-15',
 1e-10: 'e-10',
 1e-05: 'e-5',
 0.0001: 'e-4',
 0.001: 'e-3',
...

astralEv_to_sql

Value:
{1e-50: 'e_50',
 1e-25: 'e_25',
 1e-20: 'e_20',
 1e-15: 'e_15',
 1e-10: 'e_10',
 1e-05: 'e_5',
 0.0001: 'e_4',
 0.001: 'e_3',
...

_domain_re

Value:
re.compile(r'>?([\w_\.]*)\s+([\w\.]*)\s+\(([^\)]*)\) (.*)')