Monarch geneset OGS2.0

DPOGS212236
TranscriptDPOGS212236-TA1905 bp
ProteinDPOGS212236-PA634 aa
Genomic positionDPSCF300263 + 245595-257985
RNAseq coverage31x (Rank: top 75%)
Annotation
HeliconiusHMEL0168000.081.20% 
BombyxBGIBMGA004447-TA0.077.48% 
DrosophilaCG4752-PA0.065.98% 
EBI UniRef50UniRef50_Q8T5H10.063.60%AGAP001606-PA n=7 Tax=Eukaryota RepID=Q8T5H1_ANOGA
NCBI RefSeqXP_001361023.20.065.98%GA18405 [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastpgi|1984583940.065.98%GA18405 [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastxgi|1984583940.065.98%GA18405 [Drosophila pseudoobscura pseudoobscura]
Group
Gene OntologyGO:00038244.5e-158catalytic activity
KEGG pathwaydpo:Dpse_GA184050.0 
 K01469 (E3.5.2.9)maps-> Glutathione metabolism
InterPro domain[34-481] IPR0036924.5e-158Hydantoinase B/oxoprolinase
Orthology groupMCL11863 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212236-TA
ATGAACACGGAAGAGTCATCGAATCCTGCCAGTTTACTCCTTTGCTTGAAGTTGCTCTGTATAGAAGAACCATCGAACCCCCATTTGCTGATGAAGGTCCGCGGCTCGTCTCTGCGGCCCGGGGACGTGCTGCTGTCCAACCACCCCCGGGCTGGCGGCTCCCACCTCCCGGACCTCACCGTCATCACCCCGGTCTTCCACGAGTCGAGTGCTCTGCCAATCTTCTTCGTGGCTTCTCGCGGTCACCACGCGGACATCGGTGGCCTGACGCCGGGGTCCATGCCGCCGCACTCCACCAGTCTCCAACAAGAGGGAGCCTCCTTCAAATCCATGATGCTGGTGCAGCGAGGAGTCTTCAATGAGAAGGAGTTGGTTGAAGAGCTGATGAAGCCAGGTCAAGTCCCCGGGTGCTCCGGGACGAGGAATTTAGCGGACAACCTCTCAGATCTGAAGGCTCAAGTCGCCGCCAACCAGAGGGGCATACAACTGGTGTCCGAGCTGATAGAAGAATACAGCCTCGACGTGGTCCAAGCTTACATGACTCATATACAGAAGAACGCTGAACTAGCCGTTAGGGAAATGTTGAAGCAAATAGCGGAGAAGACAATCAAGAAGACGGGCTCATGTGTTCTGAAAGCCACAGAGTATTTGGACAACGGTGCACCAATCGCTTTGACGGTTACACTGGACCCCAGCACTGGCGGAGCTATCTGTGACTTCACTGGCACCGGCGTGGAGGTGTGGGGTAACTTGAACGCCCCTCGCGCCATAACTATGTCCGCTATCATTTACTGTCTGCGGTGTATGGTGGGCAGAGATATACCGCTCAACCAGGGGTGTCTGAATCCCGTGACCGTTATAATACCTCGTGGTAGTTTACTGGACCCCAGCGACTCAGCCGCTGTGGTCGCCGGGAACGTGCTCACGTCACAGAGGCTCGTGGACGTCATCCTCAAAGCCTTCCAGGTTTGTGCCGCCTCTCAAGGTTGTACCAACAATTTGACACTCGGCGAGACCACCTGGGGATATTACGAGACGGTGGCAGGCGGCAGCGGAGCGGGTCCGGGCTGGCACGGGGCGTCGGGAGTTCACACACATATAACGAACACACGCATCACGGACGTGGAGATAGTCGAAACGAGATACCCCATGATCGTGACCAACTTCTCACTGAGGAGCGGCTCCGGGGGACGGGGTAAATGGCGCGGCGGGGACGGCGTGACCCGCGAGCTGGTGTTCCGACGCACTGTGCAGGTGTCCGTCCTCACCGAACGGAGAGCCTTCCAGCCGTACGGAATGAACGGAGGGGAACCTGGCGCTAGAGGTCTGAACCTGCTCCAGCGAGCTGACGGGAGACTAATTAATCTCGGAGGAAAATCCTCAGTTACAGCGTCTCCTGGAGATAAATACATCATGAATTCGCCGGGCGGAGGTGGCTACGGTCGACCGTTAGGTGATGAGACAGGCGAACAAACAGACATACAACACAATGAGTTCGTGGAGAGAGGAAGCGTCTTCGAGTATAGAAGCGCCCATCTTAGTCAAATCTACTTCGTTAGTTTTATGAACCATGTCTTACTCCTAGCCTTGTTTGAACAAGCTGACGCCAGTAAAAATGAACAGTGCAGTTCAGACGACGAGGGTGAAATCGATCATATTTCAGAGCGAAGTCAAGGCAGTGATACGGAACAGGAATGCAGCGATGATGAGCAAAAATCCACCATTTTATTACGAAGTCGAGACCTACTTGATGTAGTGCAAGGCGCATGCGTGAAACCGGAAGATGTAGTCGGAAAAGCAATATGGGAGAAGCAAGATGCAAAAGCACAAACATGGCTTGTGACAAGGATGTCGGAAAATGCGATGATGCAGATTTTAACATGTTCGACATCTGCGGAAATGTGA

Protein sequence:

>DPOGS212236-PA
MNTEESSNPASLLLCLKLLCIEEPSNPHLLMKVRGSSLRPGDVLLSNHPRAGGSHLPDLTVITPVFHESSALPIFFVASRGHHADIGGLTPGSMPPHSTSLQQEGASFKSMMLVQRGVFNEKELVEELMKPGQVPGCSGTRNLADNLSDLKAQVAANQRGIQLVSELIEEYSLDVVQAYMTHIQKNAELAVREMLKQIAEKTIKKTGSCVLKATEYLDNGAPIALTVTLDPSTGGAICDFTGTGVEVWGNLNAPRAITMSAIIYCLRCMVGRDIPLNQGCLNPVTVIIPRGSLLDPSDSAAVVAGNVLTSQRLVDVILKAFQVCAASQGCTNNLTLGETTWGYYETVAGGSGAGPGWHGASGVHTHITNTRITDVEIVETRYPMIVTNFSLRSGSGGRGKWRGGDGVTRELVFRRTVQVSVLTERRAFQPYGMNGGEPGARGLNLLQRADGRLINLGGKSSVTASPGDKYIMNSPGGGGYGRPLGDETGEQTDIQHNEFVERGSVFEYRSAHLSQIYFVSFMNHVLLLALFEQADASKNEQCSSDDEGEIDHISERSQGSDTEQECSDDEQKSTILLRSRDLLDVVQGACVKPEDVVGKAIWEKQDAKAQTWLVTRMSENAMMQILTCSTSAEM-