Monarch geneset OGS2.0

DPOGS214248
TranscriptDPOGS214248-TA3996 bp
ProteinDPOGS214248-PA1331 aa
Genomic positionDPSCF300014 + 1290678-1300579
RNAseq coverage448x (Rank: top 27%)
Annotation
HeliconiusHMEL0044400.078.19% 
BombyxBGIBMGA005967-TA0.069.57% 
DrosophilaCG1347-PB3e-11737.39% 
EBI UniRef50UniRef50_Q7PWR93e-14730.82%AGAP008873-PA (Fragment) n=1 Tax=Anopheles gambiae RepID=Q7PWR9_ANOGA
NCBI RefSeqXP_319618.46e-14830.82%AGAP008873-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582995001e-14630.82%AGAP008873-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1700406755e-15530.01%conserved hypothetical protein [Culex quinquefasciatus]
Group
KEGG pathway 
InterPro domain[1102-1195] IPR0194604.2e-12Autophagy-related protein 11
Orthology groupMCL13245 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214248-TA
ATGTTGTATGTATTCCATGTTGACGCCGGTCAAATGACCACTTATGACATGGAGCTCACTTTACAAAGTGTTGCAAGCTTAAAAGCAGCTATAGAGAGAAAGACGAAAATACCGTCATCCTCTCTAGTGCTCCTCATAAGTGGGGGAGAGGTACTGCAGTCGGATCACATGGTATCCTCATATAGTGCAGGGACGGATTCTAACCCTATATATATGTTTAGTAAACCATCTGTTAAAGAAAGCCATTTGAAACAGTCCATGTGTGATTTGAGTCCAATAGTCGAGCTCAGTACTGGTGAATTTCGTAGTGATATTCGGAGTGCTTTCGATGGCAAGTCAGTGGCAGAATTAAAGAATGCCGTAGAATTATGTTGTTCACTCCCACCTAATATACACACTGTGATATCATATGCTACATTGGCCCAGCAGTTTGGTGACTTAGCTCATGAGGTGTCAAGGAGTTGTGATCAGTTGGTCCATGAACAGCATTTACAGCATCAGGGTTGGTCAGCGGTCATTGCTAATTTAGAGGACATATTCAATGAGTTCTGTGAGAGGTCGAGAAGTTTCAAAGAGTCATTCAGGAAACACAGACTAAAAAAGGAAGAGTATCACGAGAAACTAAACACATTGAATGAAGTACTGGAGTCATTAGGGAAAATACCAATTTTGCCTGCTCTTCAGTTAAATGCAGAGGCTCATAGGTTTTCCGCTTTTGATGTATTTGAAGAGACGGACTTTGAAGGTCATCATTATGGAGCTAAGGAGACTTTGGATAGCAGCTCAGAGAGAACTGCAGATTTTGGAGTTGGGGTGTTTAAGTTATCTGAGGAAGATGTATTTGAGCAACCATCAACAAGTAATGAGGGGGCCTCAGATGAGGCCCCAGATAAAATGACAGAGGGGGCCTCTTCTTCTGGTCAGTCAGAGGGCGCCACATTCAAGGTCAATGAGGAACATACATTGTTACACTGGATCTTGGCTCAAGGGAACCAGGCATCACTTCAGGACATATTGGATTACTGTCAGAAAGGACTCGCTCTGATCGACGCAGAGCCTCTGAAGGAGAGGGAGGCGGAGCTATACAATATCTTGGAGTACGCTAATGTTCACGGCTTGAAACAAATCAAAGGCATCGAGGATAGGCTGTACGCGTTGGAACAGCTGCTGAGTGACGTCAAGAAGAAGGAGAACGAGCAGCACGTGGTCGCCGCCTCGCTCATACAGCACCGGGATCGTCTGAACTCCGGCTGCGACCCGTCCGTCCTGCCGACGTTGTTGGAGTCCAACCAGTGTCAGTTGGGGAGACTGTTGAAGGGACACGAGCTGCTGGTCGATATTAGAAGGAGGATCATGAAGTCCAAGGACGAGTTGTCCAGGATACTGAAAGCGAGATTAGAGGCCGTGCTGGTGATAGAGAACTCGATGTCGGTGCAGGACGCGCACGCGATGCTGTCGTTCCAGTGTTTCAACCGTCTCGCTCGTTACTTCGGTATAGTGGCTCAGCTCCACCGAGCTCCGGCCGTATTCGTGAGAGCTGTCCACGAGGTCGCCAGGAGGAGGACCTTCTCACAACACCATCTCCAGTGGGCTACCGATCTCGCCAGCAAGCTCATAAAGATACACGAGGAGGAGATCAGCAGACGCCAGGAGTTCAACTCTCACTTCGAAGATCACTTCCTGAAGAGCCTCTTCCCCGGCATGACGGACCTGCCTCCCCCATTCGCTACACAGGCGCCATCTCTATACGACTCACGCCTGCCCGAACTCACTGATACGGATGTAGAGTATATATCAGAAGCTCTACCCGACTGGACTAGTGATGTACCCAAATACGATATGGAATCCACCGTTAAATTCTTCCAGCAAAGGCTCAATACATCTGACCACGAAGATAAAGACGCCGATGTTCAAGTAGATTTTGATAAAGATTTTGAATCAGAAACCGACACGGATTTTGAGAAGCTGAGTCGTCAGAGCGAAAAGCAGAAGAATGATATATCTACGAGCTGCGTCCCTCACACGATGGCCGTCTCTACAGTGACCGAGGTTGGGACTCTACCCGTTATACCGGAAAGCCCTAGAGTGGAGTTCCTTAACTCTGAATTTTATATAGAAGAATCTCTACCGTCCAGTCTGGAATGGGGCCGGGATGAACGACAGGACAATATGGACACTCACAAAATCAACATGGAGAAATTGCAAGATTTGTTCGTGAAGTTGTTTAATGTATGTAAAATAAATATTGTGTTGATAAAAGACGAGCTTACTAAGTTAAAGAGCGAAGTGGACGGTCAGAAGAAATTCATAAACACAAAATATCTAGAAATTACCGAGGCCTGGGAAAAGGTAAATGAACACGCTGAAACAAGATTCCGCGAGCAAACCCAGAGGCTGACGGTAGATCACGAATTAGAATTGAGTGATATGAAGGCGGCACTCAATGAAAAGGATGACGTCATCAGCAACCTGAAGAAAGAGACGGAAGATATGAAAATGGAACACCAGAAGGAGACGGAGAGGTTAGACAAAGAGCATAAAAGCACTAAGGAGTTGTTAGATGAAACTCGGAAAGAGATAAAAGCTTTTGAGAAGAAATTAGAGGAAGCTGAGGTTCAGAAACAAAAAGATATCAAAGAGATGCAGGAGAAGATGCATCTTGAATATAAAGCAGAGATAGAGTCGCTACGGTCGAGGTTCCGTCTCGTGGCTCTAACGAACAACATGGACAGGTCGCCGTCGGAGTCCAGCCTGGAGAAAATCGAGAGGACCGACGTCATAGAGATAGTCAGCCACAACGCTATACTGATGCAGACGAAGCAGAACGCTGAGGTGGAGAAGGAAGAAGCGGTCAAGGAGGCGGTGGAGAAATGTAAGGCGGAGTGGGAACAGAAGCTTAACGCTGAGATATGTCTGCTGAAAGCCAAGTATGAAGCTGAGAAGCAGGTGACGATAAACGACGTGACCCGTCGTCTTCTGTCAGAGAAGGATCGCCAGTTGGAACTGCTCCGGGAACGCGAGCAGACCCTCGTTCGCGAGTGCTGCAAATACAGGGACACTATACAACAACTCACTGATCCAGAGACCAACGACTACGATAGTCTCTTGAAGACTCAGTTTGCAACATTTGAAAACGAAAAGGCTGTGCTATTGCAACAAGTTGCAAGCTTGAAGGCGGAGTTAGAGAAGAAGACTGAGGAGGCGGACAAGAGGAGGGAGGAGGATAGTGACGGCAGGTCGTCTCCTCGTCGTGATATCCGTCGCCGGAGTCACACGCCGCTGGGTCTGTCTCCGGGCGCCCTGACCCTCGCCCTGGGCCAGTACCCCCAGGGTCACACCGTGCTGGTCATGTGGGACCCTGCGCATCTCAACTACACCGTACTACAGGAGGCGTCCATAATGCACTTCGTCCACAGCGACTGTCTGCCGTCCCTGGACCTTAGTATCCACGTGAAGAACGAGAGTGAGAGACGTTTGTATGCTGTGGCCACCGTGGAGTCCAAGGAATACTGCTACGCTAAGAGGGGTGTGAATAGATATCACATGCCGCGTGGATCTCGCTTCTATAGAGTCCACGTGAAGCCCCTCAAACCGCCGCTACCTCCGCCAGCCTGCTGTGATCACAAACACAAGCCTGACATGCAGAAGTCCATCGACACCAGCCAGTCGTCCAGCTCCAACGCCGATAAGACCGGTGTGGAGGTGGCCACGGCTACGCTCATCAACCTGGAGTCCCCCGTGTCTGCTGGGGAGCCTCCCGTGCCCATGATAGCGCCCGAAGACCAGCTCGACTCCATAGAGACGGAGCACAAACAACACAAGATGCAACTGTCTACAACCAGTGCTGTTTCCGAGATGGACCTCAGCGTGGGTCGTGTGGTGGGGGCGGAGGCTCCCGGGGCGGAGCCCGTGGAGCTGACGGTGAGCGCCGTGTCGGTGGTGGCGAGGGGCTCCGCGCCGCCCGGATCAGAATTGGCCGAAGAGGCCGCGCCCTGA

Protein sequence:

>DPOGS214248-PA
MLYVFHVDAGQMTTYDMELTLQSVASLKAAIERKTKIPSSSLVLLISGGEVLQSDHMVSSYSAGTDSNPIYMFSKPSVKESHLKQSMCDLSPIVELSTGEFRSDIRSAFDGKSVAELKNAVELCCSLPPNIHTVISYATLAQQFGDLAHEVSRSCDQLVHEQHLQHQGWSAVIANLEDIFNEFCERSRSFKESFRKHRLKKEEYHEKLNTLNEVLESLGKIPILPALQLNAEAHRFSAFDVFEETDFEGHHYGAKETLDSSSERTADFGVGVFKLSEEDVFEQPSTSNEGASDEAPDKMTEGASSSGQSEGATFKVNEEHTLLHWILAQGNQASLQDILDYCQKGLALIDAEPLKEREAELYNILEYANVHGLKQIKGIEDRLYALEQLLSDVKKKENEQHVVAASLIQHRDRLNSGCDPSVLPTLLESNQCQLGRLLKGHELLVDIRRRIMKSKDELSRILKARLEAVLVIENSMSVQDAHAMLSFQCFNRLARYFGIVAQLHRAPAVFVRAVHEVARRRTFSQHHLQWATDLASKLIKIHEEEISRRQEFNSHFEDHFLKSLFPGMTDLPPPFATQAPSLYDSRLPELTDTDVEYISEALPDWTSDVPKYDMESTVKFFQQRLNTSDHEDKDADVQVDFDKDFESETDTDFEKLSRQSEKQKNDISTSCVPHTMAVSTVTEVGTLPVIPESPRVEFLNSEFYIEESLPSSLEWGRDERQDNMDTHKINMEKLQDLFVKLFNVCKINIVLIKDELTKLKSEVDGQKKFINTKYLEITEAWEKVNEHAETRFREQTQRLTVDHELELSDMKAALNEKDDVISNLKKETEDMKMEHQKETERLDKEHKSTKELLDETRKEIKAFEKKLEEAEVQKQKDIKEMQEKMHLEYKAEIESLRSRFRLVALTNNMDRSPSESSLEKIERTDVIEIVSHNAILMQTKQNAEVEKEEAVKEAVEKCKAEWEQKLNAEICLLKAKYEAEKQVTINDVTRRLLSEKDRQLELLREREQTLVRECCKYRDTIQQLTDPETNDYDSLLKTQFATFENEKAVLLQQVASLKAELEKKTEEADKRREEDSDGRSSPRRDIRRRSHTPLGLSPGALTLALGQYPQGHTVLVMWDPAHLNYTVLQEASIMHFVHSDCLPSLDLSIHVKNESERRLYAVATVESKEYCYAKRGVNRYHMPRGSRFYRVHVKPLKPPLPPPACCDHKHKPDMQKSIDTSQSSSSNADKTGVEVATATLINLESPVSAGEPPVPMIAPEDQLDSIETEHKQHKMQLSTTSAVSEMDLSVGRVVGAEAPGAEPVELTVSAVSVVARGSAPPGSELAEEAAP-