FASTA format description


A sequence in FASTA format begins with a single-line description, followed by lines of sequence data. The description line is distinguished from the sequence data by a greater-than (">") symbol in the first column, a sequence identifier should follow ">" immediately, no space is allowed between ">" and an identifier. Other information can be added following the sequence identifier separated by a space, but all information should be in one line.
>your_sequence_identifier (Note: one line only, it is not allowed to have multiple definition lines) 
ATGGGGAAAat
(sequence can be in either lower or upper case, can be in one line or multiple lines)

Examples for testing the server with corresponding BLASTX output

>gnl|UG|Hs#S1728506
TGAGATCACTTCCCTTGCACAGTTTGGAAGGGAGAGCACTTTATTACAGACCTTGGAAGC
AAGAGGATTGCATTCAGCCTAGTTCCTGGTTGCTGGCCAAAGGGATCATGGACATTGAAG
CATATTTTGAAAGAATTGGCTATAAGAACTCTAGGAACAAATTGGACTTGGAAACATTAA
CTGACATTCTTGAGCACCAGATCCGGGCTGTTCCCTTTGAGAACCTTAACATGCATTGTG
GGCAAGCCATGGAGTTGGGCTTAGAGGCTATTTTTGATCACATTGTAAGAAGAAACCGGG
GTGGGTGGTGTCTCCAGGTCAATCAACTTCTGTACTGGGCTCTGACCACAATCGGTTTTC
AGACCACAATGTTAGGAGGGTATTTTTACATCCCTCCAGTTAACAAATACAGCACTGGCA
TGGTTCACCTTCTCCTGCAGGTGACCATTGACGGCAGGAATTACATTGTCGATGCTGGGT
CTGGAAGCTCCTCCCAGATGTGGCAGCCTCTAGAATTAATTTCTGGGAAGGATCAGCCTC
AGGTGCCTTGCATTTTCTGCTTGACAGAAGAGAGAGGAATCTGGTACCTGGACCAAATCA
GGAGAGAGCAGTATATTACAAACAAAGAATTTCTTAATTCTCATCTCCTGCCAAAGAAGA
AACACCAAAAAATATACTTATTTACGCTTGAACCTCGAACAATTGAAGATTTTGAGTCTA
TGAATACATACCTGCAGACGTCTCCAACATCTTCATTTATAACCACATCATTTTGTTCCT
TGCAGACCCCAGAAGGGGTTTACTGTTTGGTGGGCTTCATCCTCACCTATAGAAAATTCA
ATTATAAAGACAATACAGATCTGGTCGAGTTTAAAACTCTCACTGAGGAAGAGGTTGAAG
AAGTGCTGAAAAATATATTTAAGATTTCCTTGGGGAGAAATCTCGTGCCCAAACCTGGTG
ATGGATCCCTTACTATTTAGAATAAGGAACAAAATAAACCCTTGTGTATGTATCACCCAA
CTCACTAATTATCAACTTATGTGCTATCAGATATCCTCTCTACCCTCACGTTATTTTGAA
GAAAATCCTAAACATCAAATACTTTCATCCATAAAAATGTCAGCATTTATTAAAAAACAA
TAACTTTTTAAAGAAACATAAGGACACATTTTCAAATTAATAAAAATAAAGGCATTTTAA
GGATGGCCTGTGATTATCTTGGGAAGCAGAGTGATTCATGCTAGAAAACATTTAATATTG
ATTTATTGTTGAATTC
>gnl|UG|Hs#S1726223
ATGCACTCAAGCAGAGAAGAAATCCACAAAGACTCACAGTCTGCTGGTGGGCAGAGAAGA
CAGAAACGACATGAGCACAGCAGGAAAAGTAATGAAATGCAAAGCAGCTGTGCTATGGGA
GGTAAAGAAACCCTTTTCCATTGAGGATGTGGAGGTTGCACCTCCTAAGGCTTATGAAGT
TCGCATTAAGATGGTGGCTGTAGGAATCTGTCGCACAGATGACCACGTGGTTAGTGGCAA
ACTGGTGACCCCCCTTCCTGTGATTTTAGGCCATGAGGCAGCCGGCATCGTGGAGAGTGT
TGGAGAAGGGGTGACTACAGTCAAACCAGGTGATAAAGTCATCCCGCTCTTTACTCCTCA
GTGTGGAAAATGCAGAGTTTGTAAAAACCCGGAGAGCAACTACTGCTTGAAAAATGATCT
AGGCAATCCCCGGGGGACCCTGCAGGATGGCACCAGGAGGTTCACCTGCAGGGGGAAGCC
CATTCACCACTTCCTTGGCACCAGCACCTTCTCCCAGTACACGGTGGTGGATGAGAATGC
AGTGGCCAAAATTGATGCAGCCTCGAAACTGGAGAAAGTCTGCCTCATTGGCTGTGGATT
CTCGACTGGTTATGGGTCTGCAGTTAACGTTGCCAAGGTCACCCCAGGCTCTACCTGTGC
TGTGTTTGGCCTGGGAGGGGTCGGCCTATCTGCTGTTATGGGCTGTAAAGCAGCTGGAGC
AGCCAGAATCATTGCGGTGGACATCAACAAGGACAAATTTGCAAAGGCCAAAGTGTTGGG
TGCCACTGAATGCATCAACCCTCAAGACTACAAGAAACCCATCCAGGAAGTGCTAAAGGA
AATGACTGATGGAGGTGTGGATTTTTCGTTTGAAGTCATCGGTCGGCTTGACACCATGAT
GGCTTCCCTGTTATGTTGTCATGAGGCATGTGGCACAAGCGTCATCGTAGGGGTACCTCC
TGCTTCCCAGAACCTCTCAATAAACCCTATGCTGCTACTGACTGGACGCACCTGGAAGGG
GGCTGTTTATGGTGGCTTTAAGAGTAAAGAAGGTATCCCAAAACTTGTGGCTGATTTTAT
GGCTAAGAAGTTTTCACTGGATGCGTTAATAACCCATGTTTTACCTTTTGAAAAAATAAA
TGAAGGATTTGACCTGCTTCACTCTGGGAAAAGTATCCGTACCGTCCTGACGTTTTGAGG
CAATAGAGATGCCTTCCCCTGTAGCAGTCTTCAGCCTCCTCTACCCTACGAGATCTGGAG
CAACAGCTAGGAAATATCATTAATTCAGCTCTTCAGAGATGTTATCAATAAATTACACAT
GGGGGCTTTCCAAAGAAATGGAAATTGATGGGAAATTATTTTTCAGGAAAATTTAAAATT
CAAGTGAGAAGTAAATAAAGTGTTGAACATCAGCTGGGGAATTGAAGCCAACAAACCTTC
CTTCTTAACCATTCTACTGTGTCACCTTTGCCATTGAGGAAAAATATTCCTGTGACTTCT
TGCATTTTTGGTATCTTCATAATCTTTAGTCATCGAATCCCAGTGGAGGGGACCCTTTTA
CTTGCCCTGAACATACACATGCTGGGCCATTGTGATTGAAGTCTTCTAACTCTGTCTCAG
TTTTCACTGTCGACATTTTCCTTTTTCTAATAAAAATGTACCAAATCCCTGGGGTAAAAG
CTAGGGTAAGGTAAAGGATAGACTCACATTTACAAGTAGTGAAGGTCCAAGAGTTCTAAA
TACAGGAAATTTCTTAGGAACTCAAATAAAATGCCCCACATTTTACTACAGTAAATGGCA
GTGTTTTTATGACTTTTATACTATTTCTTTATGGTCGATATACAATTGATTTTTTAAAAT
AATAGCAGATTTCTTGCTTCATATGACAAAGCCTCAATTACTAATTGTAAAAACTGAACT
ATTCCCAGAATCATGTTCAAAAAATCTGTAATTTTTGCTGATCGAAAGTGCTTCATTGAC
TAAACAGTATTAGTTTGTGGCTATAAATGATTATTTAGATGATGACTGAAAATGTGTATA
AAGTAATTAAAAGTAATATGGTGGCTTTAAGTGTAGAGATGGGATGGCAAATGCTGTGAA
TGCAGAATGTAAAATTGGTAACTAAGAAATGGCACAAACACCTTAAGCAATATATTTTCC
TAGTAGATATATATATACACATACATATATACACATATACAAATGTATATTTTTGCAAAA
TTGTTTTCAATCTAGAACTTTTCTATTAACTACCATGTCTTAAAATCAAGTCTATAATCC
TAGCATTAGTTTAATATTTTGAATATGTAAACACCTGTGTTAATGCTTTGTTAATGCTTT
TCCCACTCTCATTTGTTAATGCTTTCCCACTCTCGGGAAGGATTTGCATTTTGAGCTTTA
TCTCTAAATGTGACATGCAAAGATTATTCCTGGTAAAGGAGGTAGCTGTCTCCAAAAATG
CTATTGTTGCAATATCTACATTCTATTTCATATTATGAAAGACCTTAGACATAAAGTAAA
ATAGTTTATCATTT
>gnl|UG|Hs#S3880904
CTTACAGAACTCCCACGGACACACCATGATAAGGACGCTGCTGCTGTCCACTTTGGTGGC
TGGAGCCCTCAGTTGTGGGGACCCCACTTACCCACCTTATGTGACTAGGGTGGTTGGCGG
TGAAGAAGCGAGGCCCAACAGCTGGCCCTGGCAGGTCTCCCTGCAGTACAGCTCCAATGG
CAAGTGGTACCACACCTGCGGAGGGTCCCTGATAGCCAACAGCTGGGTCCTGACGGCTGC
CCACTGCATCAGCTCCTCCAGGACCTACCGCGTGGGGCTGGGCCGGCACAACCTCTACGT
TGCGGAGTCCGGCTCGCTGGCAGTCAGTGTCTCTAAGATTGTGGTGCACAAGGACTGGAA
CTCCAACCAAATCTCCAAAGGGAACGACATTGCCCTGCTCAAACTGGCTAACCCCGTCTC
CCTCACCGACAAGATCCAGCTGGCCTGCCTCCCTCCTGCCGGCACCATTCTACCCAACAA
CTACCCCTGCTACGTCACGGGCTGGGGAAGGCTGCAGACCAACGGGGCTGTTCCTGATGT
CCTGCAGCAGGGCCGGTTGCTGGTTGTGGACTATGCCACCTGCTCCAGCTCTGCCTGGTG
GGGCAGCAGCGTGAAAACCAGTATGATCTGTGCTGGGGGTGATGGCGTGATCTCCAGCTG
CAACGGAGACTCTGGCGGGCCACTGAACTGTCAGGCGTCTGACGGCCGGTGGCAGGTGCA
CGGCATCGTCAGCTTCGGGTCTCGCCTCGGCTGCAACTACTACCACAAGCCCTCCGTCTT
CACGCGGGTCTCCAATTACATCGACTGGATCAATTCGGTGATTGCAAATAACTAACCAAA
AGAAGTCCCTGGGACTGTTTCAGACTTGGAAAGGTCACAGAAGGAAAATAATATAATAAA
GTGACAACTATGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAATAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAA

Sequences are expected to be represented in the standard nucleic acid codes, with these exceptions: N or n as unknown nucleic acid residue will be translated into X for unknown amino acid residue.
The nucleic acid codes supported are:
        A --> adenosine
        C --> cytidine
        G --> guanine
        T --> thymidine
        N --> A G C T (any)
Symbles for translated amino acids:
    A  alanine                         P  proline
    B  aspartate or asparagine         Q  glutamine
    C  cystine                         R  arginine
    D  aspartate                       S  serine
    E  glutamate                       T  threonine
    F  phenylalanine                   U  selenocysteine
    G  glycine                         V  valine
    H  histidine                       W  tryptophan
    I  isoleucine                      Y  tyrosine
    K  lysine                          Z  glutamate or glutamine
    L  leucine                         X  any
    M  methionine                      *  translation stop
    N  asparagine