Monarch geneset OGS2.0

DPOGS212235
TranscriptDPOGS212235-TA3750 bp
ProteinDPOGS212235-PA1249 aa
Genomic positionDPSCF300263 + 231797-243218
RNAseq coverage301x (Rank: top 37%)
Annotation
HeliconiusHMEL0168000.077.06% 
BombyxBGIBMGA004447-TA0.076.06% 
DrosophilaCG4752-PA0.061.08% 
EBI UniRef50UniRef50_Q8T5H10.058.84%AGAP001606-PA n=7 Tax=Eukaryota RepID=Q8T5H1_ANOGA
NCBI RefSeqXP_970328.10.064.07%PREDICTED: similar to RE08455p [Tribolium castaneum]
NCBI nr blastpgi|3838476510.064.98%PREDICTED: 5-oxoprolinase [Megachile rotundata]
NCBI nr blastxgi|1947548980.061.15%GF11895 [Drosophila ananassae]
Group
Gene OntologyGO:00038242e-121catalytic activity
GO:00167873.5e-96hydrolase activity
KEGG pathwaytca:6588840.0 
 K01469 (E3.5.2.9)maps-> Glutathione metabolism
InterPro domain[873-1215] IPR0036922e-121Hydantoinase B/oxoprolinase
[237-534] IPR0028213.5e-96Hydantoinase/oxoprolinase
[9-218] IPR0080401.3e-55Hydantoinaseoxoprolinase, N-terminal
Orthology groupMCL11863 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212235-TA
ATGAATAAACCAAGAGGATTTCAATTTGCAATTGATCGAGGAGGTACATTTACTGATGTATATGCTCAGTGTCCAAATGGCAAAGTGCGTGTTATGAAACTTTTGTCAGTAGATCCTCAGAACTATGATGATGCACCAAGAGAGGCTATTAGACGGATATTGCATGAGGAAACTGGTAATGCGGTAGATGAATGTGGAAAAGTCAATTCATCTCTCATAGAATCTATTAGGATGGGTACAACAGTCGCAACCAATGCTCTATTGGAAAGGAAGGGAGCTAAAATGGCTCTTGTCATTAACAAAGGATTTAAAGATCTTTTATTTATTGGAAATCAGGCACGGCCAAATATATTTGATTTGAATATAAGACGTCCTGGTGTTCTTTACAAAGAGGTTGTGGAAATCGATTGTCGTGTGATACCAGCATTAGAAGATCGCTGTGAGATTGACAAGTCTAAATTGGATTGGAAGGAAGTCATAGGAACTACTGGTCAGAAGATGTTGGTTATTAAAGACATTGATGAGGAAGCGGTGCGGAGAGATCTGGCGACTTTAAGAGAGAAAGGGATAGACAGTATAGCTGTGGTACTAGCTCATAGCTATACATACAGGGATCACGAATTGAAGATTGGAAGAATTGCTGAGGAATTAGGTTTCAACCAGGTGTCTTTGTCCCATGCGGTGATGTCTATGGTGAGGATGGTTCCCAGAGGCTATACAGCATCAGCAGACGCGTACCTCACTCCCCACATCCGGGAGTACGTGACCAGTTTCTCCAGCGGCTTCACTGATGGTCTGAAGAACACCAACGTGCTCTTCATGCAGTCGGATGGCGGACTTACGCCTATGAACATGTTTAATGGTTCGCGTGCTATACTGTCCGGACCGGCCGGCGGTGTGGTGGGCTACGCACTAACTTCATACAACAAACAAACCGGTCTACCAGTTATAGGTTTCGACATGGGCGGGACGTCGACTGACGTGTCTCGGTACGCGGGATCCTTGGAGCACGTGCACGAGGCCACGACCGCCGGGGTCACCATACAGGCGCCGCAGCTAGATATAAACACGGTGGCAGCCGGCGGTGGGTCTGTGCTGTCGTTCCTCTCGGGCCTGTTCTCTGCAGGGCCCGAGTCCGCTGGGGCCCACCCCGGGCCCGCGTGCTACAGGCGAGGGGGCCCGCTCACGGTCACGGACGCTAACTTACTGCTAGGTCGCCTGCGGCCTGATTATTTTCCAAAAATTTTCGGTCCGGCTGAGAACGAGCCCTTGGATACAGAAGCGACGCTCGCCGGCTTTAAGAAAATGACGGCAGAAATCAATGCATTCCTCAAACAGGAAGGAAACAAAGAGATGACGTTGGAGGAGGTGGCCATGGGCTTCATCAACGTCGCCAACGAGGCCATGTGCCGGCCGATCAGGTCATTGACCACAGCCCGGGGACACGATGCCCGGGCGCACGCGCTCGCATGCTTTGGGGGCGCGGGGGGCCAACACGCCTGCAGCGTGGCCCGCCGCCTGGGGATAAAGACGGTGCTCATACACAAGTACGCCGGTATCCTGTCAGCGTACGGCATGGCTCTAGCGGACGTGGTCCAAGAGGAACAGACGCCGTGTGCTGATGTGTACCGACCAGAAAACTATCGACATCTGGACGAACAGATCGATAGACTCTCCGCTGTATGTAAGGAGAAGCTTAGAGCTCAGGGCTTCACCGACGACCAGATAACACTAGAGCCGTACCTCCACTTGAGGTATTCCGGAACCGACTGTGCCCTCATGGTTTCTCCCATACAGGGCGACCAGGCTACCAGACATGGCGACTTCTACGCAGCCTTTGTTAACAGATATAAGAACGAGTTTGGCTTCACGCTATCTAACCGGGAGGTGGTAGTGGATGACGTCAGAGTGAGAGGTGTTGGAAGAAGTGGGGGAAGAAAAGAGACGGCGCTAGAGGTGGAGAAGGAAACGACACCCGCCGTTGATAATGTCGTCCAAGTCTACTTCGAAGGCGGTTATCAGAACACTGCTATATATCCGCTGGAGAAACTATCACCAGGTCATAAAATTCCAGGTCCGGCCATTATAATGGACAAATTGTCTACTATACTTGTGGAGCCGGGATGTACAGCAGATATTACTAAATACGGCGACGTGAGTATCACTATAGGCTCTGGTCAGAAGACGGAAGTGACGTCACAACTGGACTCGGTCCAGCTCAGCATCTTCTCTCACAGATTCATGTCGATAGCTGAGCAGATGGGGAGGGTTCTGCAACGTACATCAATATCAGTGAATATAAAGGAGCGTCTGGACTTCTCGTGCGCGCTGTTCGGGGCGGACGGCGGGCTCGTCTCTAACGCGCCACACATACCGGTACACCTGGGGGCCATGCAGGAGACGGTGCAGTATCAGATGAAGGTCCGCGGCTCGTCTCTGCAGCCCGGGGACGTGCTGCTGTCCAACCACCCCCGGGCTGGCGGCTCCCACCTCCCGGACCTCACCGTCATCACCCCGGTCTTCCACGAGTATGATCGTATACACCAGAATACTATACAGCAGTATCCTTACAGATATATAAAGCTGATGAAGCCAGGTCAAGTCCCCGGGTGCTCCGGGACGAGGAATCTAGCGGACAACCTCTCAGATCTGAAGGCTCAAGTCGCCGCCAACCAAAGGGGCATACAACTGGTGTCCGAGCTGATAGAAGAATACAGCCTCGACGTGGTCCAAGCTTACATGACTCATATACAGAAGAACGCTGAACTAGCCGTTAGGGAAATGTTGAAGCAAATAGCGGAGAAGACAATCAAGAAGACGGGCTCATGTGTTCTGAAAGCCACAGAGTATTTGGACAACGGTGCACCAATCGCTTTGACGGTTACACTGGACCCCAGCACTGGCGGAGCTATCTGTGACTTCACTGGCACCGGCGTGGAGGTGTGGGGTAACTTGAACGCCCCTCGCGCCATAACTATGTCCGCTATCATTTACTGTCTGCGGTGTATGGTGGGCAGAGATATACCGCTCAACCAGGGGTGTCTGAATCCCGTGACCGTTATAATACCTCGTGGTAGTTTACTGGACCCCAGCGACTCAGCCGCTGTGGTCGCCGGGAACGTGCTCACGTCACAGAGGCTCGTGGACGTCATCCTCAAAGCCTTCCAGGTTTGTGCCGCCTCTCAAGGTTGTACCAACAATTTGACACTCGGCGAGACCACCTGGGGATATTACGAGACGGTGGCAGGCGGCAGCGGAGCGGGTCCGGGCTGGCACGGGGCGTCGGGAGTTCACACACATATAACGAACACACGCATCACGGACGTGGAGATAGTCGAAACGAGATACCCCATGATCGTGACCAACTTCTCACTGAGGAGCGGCTCCGGGGGACGGGGTAAATGGCGCGGCGGGGACGGCGTGACCCGCGAGCTGGTGTTCCGACGCACTGTGCAGGTGTCCGTCCTCACCGAACGGAGAGCCTTCCAGCCGTACGGAATGAACGGAGGGGAACCTGGCGCTAGAGGTCTGAACCTGCTCCAGCGAGCTGACGGGAGACTAATTAATCTCGGAGGAAAATCCTCAGTTACAGCGTCTCCTGGAGATAAATACATCATGAATTCGCCGGGCGGAGGTGGCTACGGTCGACCGTTAGGTGATGAGACAGGCGAACAAACAGACATACAACACAATGAGTTCGTGGAGAGAGGAAGCGTCTTCGAGTATAGAAGCGCCCAAGAATCAGTTTAG

Protein sequence:

>DPOGS212235-PA
MNKPRGFQFAIDRGGTFTDVYAQCPNGKVRVMKLLSVDPQNYDDAPREAIRRILHEETGNAVDECGKVNSSLIESIRMGTTVATNALLERKGAKMALVINKGFKDLLFIGNQARPNIFDLNIRRPGVLYKEVVEIDCRVIPALEDRCEIDKSKLDWKEVIGTTGQKMLVIKDIDEEAVRRDLATLREKGIDSIAVVLAHSYTYRDHELKIGRIAEELGFNQVSLSHAVMSMVRMVPRGYTASADAYLTPHIREYVTSFSSGFTDGLKNTNVLFMQSDGGLTPMNMFNGSRAILSGPAGGVVGYALTSYNKQTGLPVIGFDMGGTSTDVSRYAGSLEHVHEATTAGVTIQAPQLDINTVAAGGGSVLSFLSGLFSAGPESAGAHPGPACYRRGGPLTVTDANLLLGRLRPDYFPKIFGPAENEPLDTEATLAGFKKMTAEINAFLKQEGNKEMTLEEVAMGFINVANEAMCRPIRSLTTARGHDARAHALACFGGAGGQHACSVARRLGIKTVLIHKYAGILSAYGMALADVVQEEQTPCADVYRPENYRHLDEQIDRLSAVCKEKLRAQGFTDDQITLEPYLHLRYSGTDCALMVSPIQGDQATRHGDFYAAFVNRYKNEFGFTLSNREVVVDDVRVRGVGRSGGRKETALEVEKETTPAVDNVVQVYFEGGYQNTAIYPLEKLSPGHKIPGPAIIMDKLSTILVEPGCTADITKYGDVSITIGSGQKTEVTSQLDSVQLSIFSHRFMSIAEQMGRVLQRTSISVNIKERLDFSCALFGADGGLVSNAPHIPVHLGAMQETVQYQMKVRGSSLQPGDVLLSNHPRAGGSHLPDLTVITPVFHEYDRIHQNTIQQYPYRYIKLMKPGQVPGCSGTRNLADNLSDLKAQVAANQRGIQLVSELIEEYSLDVVQAYMTHIQKNAELAVREMLKQIAEKTIKKTGSCVLKATEYLDNGAPIALTVTLDPSTGGAICDFTGTGVEVWGNLNAPRAITMSAIIYCLRCMVGRDIPLNQGCLNPVTVIIPRGSLLDPSDSAAVVAGNVLTSQRLVDVILKAFQVCAASQGCTNNLTLGETTWGYYETVAGGSGAGPGWHGASGVHTHITNTRITDVEIVETRYPMIVTNFSLRSGSGGRGKWRGGDGVTRELVFRRTVQVSVLTERRAFQPYGMNGGEPGARGLNLLQRADGRLINLGGKSSVTASPGDKYIMNSPGGGGYGRPLGDETGEQTDIQHNEFVERGSVFEYRSAQESV-