Monarch geneset OGS2.0

DPOGS205008
TranscriptDPOGS205008-TA972 bp
ProteinDPOGS205008-PA323 aa
Genomic positionDPSCF300123 + 292989-293960
RNAseq coverage209x (Rank: top 46%)
Annotation
HeliconiusHMEL0094618e-15076.78% 
BombyxBGIBMGA010236-TA3e-14876.23% 
DrosophilaCG6415-PA5e-8950.15% 
EBI UniRef50UniRef50_E1ZYN22e-9352.94%Aminomethyltransferase n=4 Tax=Formicidae RepID=E1ZYN2_CAMFO
NCBI RefSeqXP_001660716.11e-9955.25%aminomethyltransferase [Aedes aegypti]
NCBI nr blastpgi|1571256172e-9855.25%aminomethyltransferase [Aedes aegypti]
NCBI nr blastxgi|1571256172e-9355.25%aminomethyltransferase [Aedes aegypti]
Group
Gene OntologyGO:00065461.4e-84glycine catabolic process
GO:00040471.4e-84aminomethyltransferase activity
GO:00057373.5e-60cytoplasm
KEGG pathwayaag:AaeL_AAEL0102764e-99 
 K00605 (E2.1.2.10, gcvT)maps-> Nitrogen metabolism
    One carbon pool by folate
    Glycine, serine and threonine metabolism
InterPro domain[1-316] IPR0062231.4e-84Glycine cleavage system T protein
[2-209] IPR0062223.5e-60Glycine cleavage T-protein, N-terminal
[220-312] IPR0139775.4e-24Glycine cleavage T-protein, C-terminal barrel
Orthology groupMCL13943 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205008-TA
ATGTTACAAACTAATGTGTACGGTAAGGATTGCGTATCTTGGTTTGAATCGATATGCCCAGTCGACTTAAAGGGAATGGCACATGGTACCAGCTCATTGACTGTGTTCTTAAATAAAGACGGTGGAATTATAGATGACCTTATAATTACCAAAGTCAAAGAAGACCAACTCTACATAGTTTCAAATGCCGGTCGTTTAGAAGTCGACACACAACACATGCTCGAAACTTCGGAATTGTATCGTAAGAAAGGCAAAGATGTAAAAGTTAGTATGTGGGATGTGACACAAAGAGCCTTAATAGCTGTGCAAGGGCCGAAGGCAGCGGCTGCTGTGCAAGCTATTACAAATTTACAATTAGAGGAACTAACATTTATGACATCCCGGATTGGTTTAGTTGCGGGGGTTGAATGTAGAGTAACCCGCTGTGGTTACACCGGTGAAGATGGTGTAGAGATTTCCATTCCAGAAGATAAAGCCGTTCAAGTAACTGAGGCTTTATTACAGTGTAAAGATGTGAAACTCGCTGGTCTGGGAGCTAGAGATTCCCTCCGTTTAGAGGCTGGATTATGTTTGTATGGAAACGATATCGATGAAACTGTTACTCCTGTTGAAGCTGCTTTGACTTGGTTAATATCTAAGAACCGTCGGCAAAGCGCTGCTTTCCCCGGAGCCGACATTATTTTACGTCAAATCAAAGATGGCGTAAGCAAAAGAAGGGTTGGTTTAAGAATGGTGGAGGGAGCACCGGCACGTAAGGACGCGCTTTTGAAAGATGCTACCGGAAATGTCATTGGTAAAGTGACAAGCGGCTGTCCAAGTCCCTCGCTTGGCGGGAACGTAGCAATGGGATACGTAAAGGAAGAATTTAAAAAAGTCGGAAACGAATTACTTGTAAACATCCGAGGAAAAGATGTGGCTTGCAAAGTCGCGAAAATGCCTTTCGTCCCCTCAAAATATTACATAAAGAAATAA

Protein sequence:

>DPOGS205008-PA
MLQTNVYGKDCVSWFESICPVDLKGMAHGTSSLTVFLNKDGGIIDDLIITKVKEDQLYIVSNAGRLEVDTQHMLETSELYRKKGKDVKVSMWDVTQRALIAVQGPKAAAAVQAITNLQLEELTFMTSRIGLVAGVECRVTRCGYTGEDGVEISIPEDKAVQVTEALLQCKDVKLAGLGARDSLRLEAGLCLYGNDIDETVTPVEAALTWLISKNRRQSAAFPGADIILRQIKDGVSKRRVGLRMVEGAPARKDALLKDATGNVIGKVTSGCPSPSLGGNVAMGYVKEEFKKVGNELLVNIRGKDVACKVAKMPFVPSKYYIKK-