Welcome to Allermatch.org (tm)

6.1. Sequence Homology as Derived from Allergen Databases

The commonly used protein databases (PIR, SwissProt and TrEMBL) contain the amino acid sequences of most allergens for which this information is known. However, these databases are currently not fully up-to-date. A specialized allergen database is under construction.

Suggested procedure on how to determine the percent amino acid identity between the expressed protein and known allergens.

Step 1: obtain the amino acids sequences of all allergens in the protein databases (for SwissProt and TrEMBL: see http://expasy.ch/tools; for PIR see http://wwwnbrf.georgetown.edu/pirwww ) in FASTA-format (using the amino acids from the mature proteins only, disregarding the leader sequences, if any). Let this be data set (1).

Step 2: prepare a complete set of 80-amino acid length sequences derived from the expressed protein (again disregarding the leader sequence, if any). Let this be data set (2).

Step 3: go to EMBL internet address: http://www2.ebi.ac.uk and compare each of the sequences of the data set (2) with all sequences of data set (1), using the FASTA program on the web site for alignment with the default settings for gap penalty and width.

Cross-reactivity between the expressed protein and a known allergen (as can be found in the protein databases) has to be considered when there is:

1) more than 35 % identity in the amino acid sequence of the expressed protein (i.e. without the leader sequence, if any), using a window of 80 amino acids and a suitable gap penalty (using Clustal-type alignment programs or equivalent alignment programs)

or:

2) identity of 6 contiguous amino acids.

If any of the identity scores equals or exceeds 35 %, this is considered to indicate significant homology within the context of this assessment approach. The use of amino acid sequence homologies to identify prospective cross-reacting allergens in genetically modified foods has been discussed in more detail elsewhere (Gendel, 1998a; Gendel, 1998b).

Welcome to Allermatch.org^tm

Important: read the disclaimer below before using this website

All sequences submitted will be treated confidentially

Go to the search page immediately

Contents

Disclaimer

SwissProt copyright statement

Acknowledgement

About this website

Example of matching an input sequence

80 amino acids sliding window

Summary table

Detailed information

Example

Exact hits of small stretches of identical amino acids

Summary table

Detailed information

Example

Full alignment

Example

About us

Feedback

References

Welcome to Allermatch.orgtm

Important: read the disclaimer below before using this website

All sequences submitted will be treated confidentially

Go to the search page immediately

Contents

Disclaimer

SwissProt copyright statement

Acknowledgement

About this website

Example of matching an input sequence

80 amino acids sliding window

Summary table

Detailed information

Example

Exact hits of small stretches of identical amino acids

Summary table

Detailed information

Example

Full alignment

Example

About us

Feedback

References

Welcome to Allermatch.org^tm