Monarch geneset OGS2.0

DPOGS203960
TranscriptDPOGS203960-TA2532 bp
ProteinDPOGS203960-PA843 aa
Genomic positionDPSCF300005 + 429589-434980
RNAseq coverage636x (Rank: top 20%)
Annotation
HeliconiusHMEL0135140.088.76% 
BombyxBGIBMGA002020-TA0.087.60% 
DrosophilaCG3626-PA0.062.59% 
EBI UniRef50UniRef50_B4NP810.062.17%GK15723 n=15 Tax=Neoptera RepID=B4NP81_DROWI
NCBI RefSeqXP_970207.20.069.63%PREDICTED: similar to nad dehydrogenase [Tribolium castaneum]
NCBI nr blastpgi|2700046900.069.44%hypothetical protein TcasGA2_TC010363 [Tribolium castaneum]
NCBI nr blastxgi|2700046900.069.53%hypothetical protein TcasGA2_TC010363 [Tribolium castaneum]
Group
Gene OntologyGO:00551143.9e-61oxidation-reduction process
GO:00164913.9e-61oxidoreductase activity
GO:00065462.3e-51glycine catabolic process
GO:00040472.3e-51aminomethyltransferase activity
GO:00057372.3e-51cytoplasm
KEGG pathwayreh:H16_B19551e-172 
 K00315 (E1.5.99.2)maps-> Glycine, serine and threonine metabolism
InterPro domain[1-348] IPR0060763.9e-61FAD dependent oxidoreductase
[477-693] IPR0062222.3e-51Glycine cleavage T-protein, N-terminal
[703-806] IPR0139779.3e-17Glycine cleavage T-protein, C-terminal barrel
Orthology groupMCL10485 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203960-TA
ATGGGAGCTGCTGTTGCTTATCATCTTGCAAGAAGAGGATGGGGCCCCCATACTGTAGTTATAGAAAAAGAAAAAGTTGGTGCAGGCAGTAGGTGGCATTCTTCAGGTCTTGTTGGAGCCTTTAAGCCCACTCTTGCTCAAGTTCGTATTGCTCAATCTTCTATAAGACTCCTAGCAGATTTGGAATCACAAGGAAGACCCACTGGTTGGAAACAATGTGGATCACTGCTATTGGCTCGAACCAGGGACCGCATGACTGTCTATAGAAGAATGAAGAGTCAATCAGTCTCCTGGGGCATTGATTGTGAACTCGTCAGTCCTAAGAGATGTCAAGATATATGGCCAATGCTGAACGTTGATGATGTGCTTGGTGGTCTATGGATACCTGGGGATGGAGTGGGTGACACACATTTATTTTGCATGTCCCTAATGAGAGCTGCTGTGGAAAATGGAGTTGGTGTTATGGAGGATTGTTCAATAAAGGCTGTACACTCTAAAGATGGTAAGGTATCTGGTGTGGATACCACAAGTGGTTCAATTGAATGCCAGTATTTTGTCAACTGTGCTGGTTTTTGGGCTAGACAGATTGGACAGCTAGCGAAGCCACAAGTTAAAGTTCCCTTGCTTCCGTGCGAACACTACTATTTGCATACAAAGCAGATTGAAAACTTGAGTCCTATGATGCCAGTTGTCCGTGATCCAGATGGCTTCATTTACTTACGAGAGAGGAATGGCTGTATACTGGCTGGGGGTTTTGAACCAATTGCAAAACCAGTGTATGAAGAAGAAATTCTCAATGCAGCCCAACGGTGTGTCGCTGAAGATTGGGACCATTTCCACATTTTATTGCAAGAATTGCTGAAACGTGTTCCCAGTCTCAATCAGGCTGTTTTACACAAACTTTGCAATGGACTCGAGGCATTTTCACCGGATTGTAAATGGATCGTAGGAGAATCACCTGAGATGGCAAACTACCATGTTGCTTGCGGCATGAAGACAGTTGGGATATCAGCAGCTGGAGGCGTCGCTGAAGCTACTGTAGATGAGATCGTTGATGGATATTCAAAGTATGATATGTACGAACTAGATATAAATAGATTTCTCGGTCTGCATAATAATAAACGGTTTCTGAGGGATAGGATGAAGGAAGTTCCAGGTGTTCATTATGGATTACCGTATCCATTTTACGAATTCGAAACTGGGCGGAATCTTCGACTGTCTCCGATTTATCAAACCCTAAGGGATAAGGGTGCTACGTTTGGGCAAGTGATGGGATATGAAAGACCAACTTGGTTCGAACCAGTCGATACAAGTACAGATTCTGATAAACCGAGACCGTTTAAAATAGCTCATACAAAAACATTTGGTAAACCGCATTGGTTCGATACGGTGCAAAGCGAATACTGGTCCTGTCGCGAGTCCGTAGGATTAGCTGATTATTCTTCGTTTACAAAAATAGACATACAGTCCCAAGGAACCGAAGTTGTGGATTTACTTCAATATTTATGTTCAAATGACGTGAATGTTCAAATCGGCAGCATCATTCATACTGGAATGCAAAATGAACGTGGGGGTTACGAGAACGATTGCAGCCTTGCTAGAATTGCAGAAAATCATTACATGATGATAGCACCAACAATCCAACAAACTAGGTGTAAAGTTTGGCTTCAAAGACATTTACCTAAAAATGGTAGTGTGACCTTATCGGATGTCACTTCAATGTATACAGCTATTTGCATATTAGGACCGTTCACTAGAAGTCTCCTGTCTGAATTGACGGATACAGATTTATCGCCATCAAATTTTCCATTCTTCACGTTTAAAGAGTTGGACGTAGGTCTAGCTAATGGTATCCGAGCGATGAATTTGACCCACACTGGGGAATTGGGTTATGTTCTATATATTCCAAATGAGTTCGCTCTCCACGTTTATCATCGTTTGTTGACCGTTGGCGAGAAATTTGGTATAAGACATGTTGGACACTACGCAACAAGGGCGCTGCGTGTAGAGAAGTTTTTTGCCTTCTGGGGACAAGACCTCGACACTATGACCACGCCATTAGAATGTGGACGAACTTGGCGCGTCAAATTCGATAAAAATATACCATTCATCGGCCGTGACGCCTTGCTACGTCAGAAAGAAGAAGGTATAAGGAGACAATACGTTCAGCTCCTGCTGACAGATCACGACCACGAGATGGATCTGTGGTCGTGGGGCGGTGAGCCGATATACAGGGACGGTGACTATTGCGGCCAAACTACAACCACCAGCTACGGATATACCTTTAAGAAACAGGTCTGCCTCGGCTTCATACAAAACTTGGATAAAAATGGTACAGAACAAAGGGTTACTAACGATTACGTTCTCAGTGGACACTACGAAATTGATATCGCTGGCATACGTTATGCAGCGAAAGTGAACCTACATTCACCAAATCTACCTACCAAATATCCAGACAAAGAGCGAGACGTTTACCAAGCAACGAGAAAACAACACGAACATCAATATATGGGGCGTCATTATCAACCTTAA

Protein sequence:

>DPOGS203960-PA
MGAAVAYHLARRGWGPHTVVIEKEKVGAGSRWHSSGLVGAFKPTLAQVRIAQSSIRLLADLESQGRPTGWKQCGSLLLARTRDRMTVYRRMKSQSVSWGIDCELVSPKRCQDIWPMLNVDDVLGGLWIPGDGVGDTHLFCMSLMRAAVENGVGVMEDCSIKAVHSKDGKVSGVDTTSGSIECQYFVNCAGFWARQIGQLAKPQVKVPLLPCEHYYLHTKQIENLSPMMPVVRDPDGFIYLRERNGCILAGGFEPIAKPVYEEEILNAAQRCVAEDWDHFHILLQELLKRVPSLNQAVLHKLCNGLEAFSPDCKWIVGESPEMANYHVACGMKTVGISAAGGVAEATVDEIVDGYSKYDMYELDINRFLGLHNNKRFLRDRMKEVPGVHYGLPYPFYEFETGRNLRLSPIYQTLRDKGATFGQVMGYERPTWFEPVDTSTDSDKPRPFKIAHTKTFGKPHWFDTVQSEYWSCRESVGLADYSSFTKIDIQSQGTEVVDLLQYLCSNDVNVQIGSIIHTGMQNERGGYENDCSLARIAENHYMMIAPTIQQTRCKVWLQRHLPKNGSVTLSDVTSMYTAICILGPFTRSLLSELTDTDLSPSNFPFFTFKELDVGLANGIRAMNLTHTGELGYVLYIPNEFALHVYHRLLTVGEKFGIRHVGHYATRALRVEKFFAFWGQDLDTMTTPLECGRTWRVKFDKNIPFIGRDALLRQKEEGIRRQYVQLLLTDHDHEMDLWSWGGEPIYRDGDYCGQTTTTSYGYTFKKQVCLGFIQNLDKNGTEQRVTNDYVLSGHYEIDIAGIRYAAKVNLHSPNLPTKYPDKERDVYQATRKQHEHQYMGRHYQP-