
|
GPCRHMM
A GPCR detection method
|
Instructions
This method predicts G protein-coupled receptors from amino acid
sequence.
GPCRHMM is described in
Markus Wistrand*, Lukas Käll* and Erik L.L. Sonnhammer.
A general model of G protein-coupled receptor sequences and its application to detect remote homologs.
Protein Science, 15 (3):509-21, Mars 2006.
*These authors contributed equally to this work.
Instalation
To install the method you need to:
- Download gpcrhmm package frome here
- untar it in a suitable directory using the command
tar xvzf gpcrhmm.tar.gz.
- Obtain a copy of phobius.
- Copy the executable
decodeanhmm from the phobius directory to the gpcrhmm directory.
- Make a link from
gpcrhmm.pl to your standard path e.g /usr/local/bin.
Input
The program gpcrhmm.pl takes proteins in FASTA format. It recognizes the 20
amino acids and B, Z, and X, which are all treated equally as
unknown. Any other character is changed to X, so please make sure the
sequences are sensible proteins.
This is an example (one protein):
>EBI2_HUMAN P32249 [Homo sapiens (human)] EBV-induced G protein-coupled receptor 2 (EBI2).
MDIQMANNFTPPSATPQGNDCDLYAHHSTARIVMPLHYSLVFIIGLVGNLLALVVIVQNR
KKINSTTLYSTNLVISDILFTTALPTRIAYYAMGFDWRIGDALCRITALVFYINTYAGVN
FMTCLSIDRFIAVVHPLRYNKIKRIEHAKGVCIFVWILVFAQTLPLLINPMSKQEAERIT
CMEYPNFEETKSLPWILLGACFIGYVLPLIIILICYSQICCKLFRTAKQNPLTEKSGVNK
KALNTIILIIVVFVLCFTPYHVAIIQHMIKKLRFSNFLECSQRHSFQISLHFTVCLMNFN
CCMDPFIYFFACKGYKRKVMRMLKRQVSVSISSAVKSAPEENSREMTETQMMIHSKSSNG
K
GPCR detection
The usage of the method is to detect putative GPCRs in from amino acid sequence.
Either give the name of the local file in which you have the proteins
in the, or paste the sequence(s) into the input form.
If you paste; make sure that the first line of each entry (the identifier) is not broken.
There is a global and a local score (explained below). The local score
is only calculated for the proteins that have a global score higher
than a threshold of 0. To only use global scoring is faster,
but tend to accumulate false positives.
Global/Local score
GPCRHMM is based on a hidden Markov model that mimics the common topology of
GPCRs: they all span the membrane sevenfold. Global scoring takes
the sequence through the entire model, including the N- and C-terminal
sections. The reported score is calculated using the forward algorithm.
Local scoring takes the subsequence that spans the 1st-7th
TM-helix (as predicted by the 1-best algorithm using the global model). The subsequence is scored to a core
model that only have compartments corresponding to the 1st-7th
TM-helix (forward algorithm). The local score can detect false positives that don't fit the model
well, but which have long N- or C-terminal regions that accumulate
score because of biased sequence composition.
Output
Output is global score, local score and a prediction (GPCR/no).
All reported scores are log-odds scores related to a null model. The
null model reflects the amino acid composition in SwissProt.