DNA to protein translation tool

Start using this tool

How to use custom genetic codes

The genetic code used to translate a sequence into protein may be customized.

This service allows introducing the genetic code as a string, where each character corresponds to one aminoacid and asteriscs represents termination codes. In the example bellow is shown the standard genetic code and the corresponding triplets.

Standard genetic code

Aminoacid/Termination FFLLSSSSYY**CC*WLLLLPPPPHHQQRRRRIIIMTTTTNNKKSSRRVVVVAAAADDEEGGGG

-- Base1 TTTTTTTTTTTTTTTTCCCCCCCCCCCCCCCCAAAAAAAAAAAAAAAAGGGGGGGGGGGGGGGG
-- Base2 TTTTCCCCAAAAGGGGTTTTCCCCAAAAGGGGTTTTCCCCAAAAGGGGTTTTCCCCAAAAGGGG
-- Base3 TCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAG


Explanation

In the first line, the first character ("F") represents Phenylalanine,which is encoded by the triplet TTT (first character of "Base1",
first character of "Base2" and first character of "Base3")

The eleventh character ("*") represents a termination code, which is encoded by the triplet TAA.



The custom genetic code provided must be 64 characters long. Correspondence between characters and aminoacids may follow the system used in this service or may be different, but it will be always case insentitive.

Methionine as a initiation code

When searching "ORFs trimmed to MET-to-Stop", they will be shown the longest ORFs available (from methionine to Stop), so that within the ORF  there may be several methionines, as for example in the aminoacid secuence bellow:

MQVVLITLSDVNSTTWGSRISLGYMAACFRVREVELVKNLMMTGVVLQFTVDFPPSNSEFPHMLGNSNTISPFIPISAT



1-letter aminoacid codes

    A  alanine                         P  proline
B aspartate or asparagine Q glutamine
C cysteine R arginine
D aspartate S serine
E glutamate T threonine
F phenylalanine U selenocysteine
G glycine V valine
H histidine W tryptophan
I isoleucine Y tyrosine
K lysine Z glutamate or glutamine
L leucine X any
M methionine * translation stop
N asparagine - gap of indeterminate length