Monarch geneset OGS2.0

DPOGS201807
TranscriptDPOGS201807-TA1737 bp
ProteinDPOGS201807-PA578 aa
Genomic positionDPSCF300145 + 116672-130147
RNAseq coverage645x (Rank: top 20%)
Annotation
HeliconiusHMEL0035690.095.16% 
BombyxBGIBMGA013235-TA0.084.87% 
DrosophilaslgA-PE0.067.19% 
EBI UniRef50UniRef50_B3MRV80.066.84%GF20923 n=22 Tax=Bilateria RepID=B3MRV8_DROAN
NCBI RefSeqNP_996527.10.068.63%sluggish A, isoform G [Drosophila melanogaster]
NCBI nr blastpgi|246437170.068.63%sluggish A, isoform A [Drosophila melanogaster]
NCBI nr blastxgi|246437170.068.63%sluggish A, isoform A [Drosophila melanogaster]
Group
Gene OntologyGO:00046571.6e-225proline dehydrogenase activity
GO:00551141.6e-225oxidation-reduction process
GO:00065623.8e-102proline catabolic process
GO:00065373.8e-102glutamate biosynthetic process
KEGG pathwaydpo:Dpse_GA128020.0 
 K00318 (E1.5.99.8)maps-> Arginine and proline metabolism
InterPro domain[22-572] IPR0156591.6e-225Proline oxidase
[205-551] IPR0028723.8e-102Proline dehydrogenase
Orthology groupMCL12426 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201807-TA
ATGGCTCTGCTACGTCGACTGGCTGTGAACGCGCCCCGAGGCGTCCGAGTTTTGTCCACGCCGCCACCGTCTCGTGACGAACTAGATCTAACCTTCAACAGTCCGAGAGATGCTTTCAAGAGCAAGAAAACTAGCGAATTGGTCCGGGCGTACCTCGTATATCAAATATGTTCGATCAACTGGGTTGTCGAGAACAATGCTATGCTGATGAAACGCCTCCGCCAGCTGGTCGGTCAGAGGCTGTTCGAAGCCATCATGAAGGCCACCTTCTACGGCCAGTTCGTCGCCGGCGAGGACCAGATCAAGATACAACCGACGCTTGACAGGCTGCGGTCGTTCGGTGTAAAGCCGATCCTCGATTATTCCGTGGAGGAAGATCTCTCCCAGGAGGAGGCTGAGAAGCGCGAAGTGAGCGCTTCGATATCGACGTGCGGCGACACGCAGGAGGAGGGTCAACTGAAGCAGTACCACGTGGAGCAGAGATTCGCTGATCGCCGGTACAAGGTCACCAGCGCTAGAACATACTTCTACCTGAACGAGGCCTCATGCGAGAAGAACATGGAAGCGTTTATGAACAGCATCGACACCGTCGCCAAAATAACCAAGAGCACTGGACTTATGGCCGTGAAACTAACAGCCCTTGGCAGACCACAGTTACTTCTCCAACTGTCCGAGGTGATAATGCGCGCCCGTAGCTATATGCAGCAGATAGCTGGCGGTACTGGGAACGTACTCGCCCATCATAAGACCATCGAAGACCTGCAGAGATACTTAGGGGATTACAGCGCTCGGCCCGAAGTACAGGACTTTATGAACAAAGTCACCTCCGACACGGAAGGTATCGTCCATCTTTTCCCGTGGTCGAACATTCTGGATAAGGATATGGGTTTGTCAGATTCATTCCGCGTCCCTGACCCGAAGACCGGTCAGATGCGACGCCTCATCTCCCAGATATCGCCCAAGGAGGAGGAAATGTTCAGGAACATGCTGCGGCGTCTCAACAATATAATACAGGTGGCCAACGAGCATGACGTCAGGATTATGATAGACGCCGAACAGACATACTTTCAGCCGGCCATCTCGAGGATCTGTCTCGAAATGATGAGGAGGTATAACAAGAACAAATTCCTCGTATTCAATACATACCAGACCTATCTGAAGAACACGTACAACGAGATAGTGACTGATCTCGAACAGGCGCAGCGTCAGAACTTCTACTGGGGTGCCAAGCTGGTCCGGGGGGCCTACATAGAGCAGGAGCGTGCCCGTTCAGCCGCTATGGGCTACGAGGATCCCACGTGTGAGAGCGTCGACGCTACGACAGCATCATTCCACCGCTGTCTCAAGGAAATACTCAGCCGGGTTAAGAACGAGCAAAACGATCGTCTCGGTATAATGGTGGCCTCTCACAATGAGGACACCGTCCGTTATGCCATCCAGTTAATGAAGGAACACGGCATCGGGCCGGGGGATAAGGTGGTGTGCTTCGGGCAACTGCTGGGGATGTGTGATCACATCACATTCCCATTGGGTCAAGCTGGTTATTCGGCTTATAAGTATGTTCCTTACGGTCCTGTGCTGGAAGTGCTGCCATACTTGTCCCGTCGAGCAAATGAGAACAGAGGCTTCCTCCAGAAGATAAAGAAGGAGAAGGGTCTGCTTCTAAAAGAGATATTCCGTAGAATGTTCAGCGGACAGCTGTTCTACAAACCGTCTGGGAACTATACACCGGTTTAA

Protein sequence:

>DPOGS201807-PA
MALLRRLAVNAPRGVRVLSTPPPSRDELDLTFNSPRDAFKSKKTSELVRAYLVYQICSINWVVENNAMLMKRLRQLVGQRLFEAIMKATFYGQFVAGEDQIKIQPTLDRLRSFGVKPILDYSVEEDLSQEEAEKREVSASISTCGDTQEEGQLKQYHVEQRFADRRYKVTSARTYFYLNEASCEKNMEAFMNSIDTVAKITKSTGLMAVKLTALGRPQLLLQLSEVIMRARSYMQQIAGGTGNVLAHHKTIEDLQRYLGDYSARPEVQDFMNKVTSDTEGIVHLFPWSNILDKDMGLSDSFRVPDPKTGQMRRLISQISPKEEEMFRNMLRRLNNIIQVANEHDVRIMIDAEQTYFQPAISRICLEMMRRYNKNKFLVFNTYQTYLKNTYNEIVTDLEQAQRQNFYWGAKLVRGAYIEQERARSAAMGYEDPTCESVDATTASFHRCLKEILSRVKNEQNDRLGIMVASHNEDTVRYAIQLMKEHGIGPGDKVVCFGQLLGMCDHITFPLGQAGYSAYKYVPYGPVLEVLPYLSRRANENRGFLQKIKKEKGLLLKEIFRRMFSGQLFYKPSGNYTPV-