Monarch geneset OGS2.0

DPOGS204068
TranscriptDPOGS204068-TA4548 bp
ProteinDPOGS204068-PA1515 aa
Genomic positionDPSCF300200 + 28383-34108
RNAseq coverage137x (Rank: top 55%)
Annotation
HeliconiusHMEL0131320.064.01% 
BombyxBGIBMGA010808-TA0.048.42% 
DrosophilaSpargel-PB4e-3047.55% 
EBI UniRef50UniRef50_UPI0002064CDD2e-3143.53%UPI0002064CDD related cluster n=1 Tax=unknown RepID=UPI0002064CDD
NCBI RefSeqXP_001122841.13e-3142.94%PREDICTED: similar to CG9809-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3800235804e-3143.53%PREDICTED: uncharacterized protein LOC100870622 [Apis florea]
NCBI nr blastxgi|1571380549e-5424.56%hypothetical protein AaeL_AAEL003768 [Aedes aegypti]
Group
Gene OntologyGO:00001664.1e-12nucleotide binding
GO:00036762.4e-10nucleic acid binding
KEGG pathwaygga:4228152e-15 
 K07202 (PGC1)maps-> Huntington's disease
    Adipocytokine signaling pathway
    Insulin signaling pathway
InterPro domain[1371-1439] IPR0126774.1e-12Nucleotide-binding, alpha-beta plait
[1383-1439] IPR0005042.4e-10RNA recognition motif domain
Orthology groupMCL25776 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204068-TA
ATGGAGTCGCATATTTTGAATATGTATCACCAGGCCCCATATAGGAATATTGGTCATAACATACTCCGTTCGATAAGCGAGTCGTTATCATCCGAGGGCAGTTGTAACCAGAACAGCCCGGAACAGCAGGCTGATGAGAACGAAGTGTATTGGACAAGAAATACTCAGGTTTGGAGTCAAAATCAGAATGTAAATATAACACAGTCGAATCAAGATATAAGTGTAGATGTTGATGAAAATATTGAAGTACAGGAAGTGTCTATAGATGATAGGAACTCTGAAACCGATGGTCAGTTGGACACTGATAATATGGAGGAAAATTCGCTAGTGGAAGGTGATGATTATGATATCATCCAAAAGAAGATAATACATCAGATGAAGGGGAATACATCGATCCTGAAGATAAGATCCGATACGGAACCTAGTGTATCCCAAGACAGTCTAGAATTAAATTTCGATGCACCAGTCGTTTCAAATGTCGATGAATATTTTATAAAACAAAATGATGAGAAGCATATATTAGAGGTGAAAGACGAGCCGGTGGTCAAAGATGTGGATGAGTATTTCATCAAGGACACTAAAGATATGCCAACGATTCCACCACCGACAATCGTTGAGGAACTCCTAGTTAAAAGCAAATTACCTGAGACAGATTTCCGGATATCCAAAACTGTACCAGACGAGATATTCGAGTCCGAGCCGACCGTTGATGTTAAAGAAATAGAAACAGTCGAGCAACAAGAAGATTCAGTGGCAGATGTCTCCATCAACGAATCAGATTTAGAAGTGCTTCCTAATATAGAAGATCTAAAGCGATATCTCTTGGACGATTTGCCTTACACTAAACTGAAGAACGTACAGAAGTCTTACTCCGTCTCTCTACCACATTCGCCGATGCACAATATTTTGGATATAGATTCTAAAACGTGTTTAAGTTTCGAGGATCTTAATTTAGATCTATCTGATCTCACTTTCGAGAATGAGAAAGAGAAATCGAGTGCGAGCTTAAGGAGTGATGATATGCCACGGACGTTGACGGAGGAAGACGTGAACAGCTTTCTGATAACGAATCAAACAGAGTCCAAGGAAGTTGTGACCGTCGATGACTGTTGCCCCCAAGATATGGAAATCGATAGACCTCTGGGTGCTATTATCAGCAACGGAGACGTTCAAACCGTCATACTACCAGACAGAAAATGCACATCAACGCCCATCCCAAAACCCAATGTACTAGAGTTCTGCATAGAGAAGGTCGCTGTGAAAAAAGAACCAGATATAAAAGTCGAAACAGATGATTTTGTGGACGTCGAATCGTGCAACGACGCTGTCATACCCGTCCTGGAAGCGAATAATCTTAACTCCCTCCTAGAACAGTTTGAGGCAACCGAAAAATTGAACAAACGAAGAAAACTATCAGTGAATGTGAGTGATTCTAAAACAAAGACAATAAGCATAACTAGTGGCATGAGACTGCAAGACGCTGGTGTACAATTGAACAAGACGAAGATGCGACAAATATTGATGCCGTCGCCCCTTAACACTGTTATGAGACGTTCTCCAAGTCCGATCCACTCTGATCACGATTATTGTTCGTCCAAGAAACGGCTCAGCCTCCCGAATCTTAAGGGCGGTCAGAGTCTTCTCAAACCCGAGGTCCTGTCCAGCAATAATAAAATACTCAGCTCGAGACATAGGTCGTGTAAAAATAAAAAAGTTGTCTACCACCTCAGCAGCGACGACGAAAGCGATGCTAATACAGCAAAGAAAAATAAAATTCTAAATAATAATCGTGTAGCTGATGATGTTTTTGTTAAGAATAATAAAAAATCTAATATAAAACTAACTGTTAAGGCAGCGGCGAATTCGAATCATCGTAAGAAACAGTCGCCACCGCGTAGTGTGAATGATTGTGATGTTAACAAAGATAACGGCTCTGTGATGGTGAAAAATGCTTTTAAAGCAAGTGATACTTCTTGTAGTCAAAATTGTAACGGTAGCATTAAGTTAACGATAAAAAATAAATCCGAGGTTATTATCAAGAACTGTGATTTTAAGGACAATCGTAAGGACAACGTTGATAAAAATAAATTTGTAGGTGTAAGTGCTAATAGATTTTTGAACGATATAAATAATTCTAATAAAGGCATAGATACTTTAGATAGGAACAAAACGAAAGACTTAAATAGGGTAGAGAAACATTTTAATGTAATCGCAAAACAAGAAGTCAATTCTAAAGAACATTTCTACACTGCGCTGTTTAATGATAAACAGGATATTGAGCTTCCGCAAATAAAAGCCGAGAATAACGTCAAAGATGAGCAAACACAGGCCGAGAGCTTAAATGATTTGGAACAACCACAAAAGAAGAAAAAGCTGAATCTCCAAGAGTACAAATTAAGGCGAAATGTTAGTTCAAATGCTAGCTCAGCTCAAGTTAGCCCCGAAGCTATATTTCCCGATATCCCATGCAACATAAATCTTGATAAGAATTTAAGGGCAGTAAACAATCAAACAGCCAGTGACGTTGTTTCGGCACCAAAAGAGCCTTTAATTTCAGAAGCCCCAAAAACAATCTTCGATCCCATAAGAGAAGCTTCTAGAAAAATACTCATGAATTCCAAAAAGCAAAAGGCTGAAGCTATGAGGAAAAGAGATGAAGATATTGTTATGAGCAAAATACCTAAAGTGGAAAACTTAGAACTACAGCCCCTTATAAGTGATGCGGAAATGATGAAAATTGTTGGCATGACACCTAAGATACTCCCTGTGCCTATTGTACCGCCAACACAAACAGTCGTAGAAGATAAAGTTCAACTAAAAGATCATGACGAAATTGTACTTGTTAGCATCGGTACAAACACTGATGAGAATATGTTCAAACAGATAGACAAGGTTATTGAGAGCAAAAAACGGAAGTCCTCATCGCCCAAACACGAAAACAAAATGACGATCAACTTTAAGATCAAAAAATCTGATCCCGTGCTGAAACAAAACGTATTCGATACAGTTAAACGAAGCAAAAGTCCCATCAATGAGAAAAATCATTCGGAGGTTAAAATCGATAAAGAGAGGCTCAAAGATATTACAGCGACGTTAAAGAGTGTAGAAAAACAAGTGGACACGAAGATTTCTAGCAATTCTCTATTTGCTAGTATCCAAGACGTTGTGATGAAGAATGCTCCAACTGCTGATATTACTAAAGCTGAAAAGTCCCCGAAACATAGCTCAGTCGAAAAACGTGATGCACACCATAAATATAAAACGAGCATAGTGCGACAATACGACAATAGTGATGACCATGGCGAGGACAAAATAATCCTGCATTTAGAGAAGAATCGTAAAAAACCTGATCAAGCTAATGTCGAAGTTCAGACTGATTCTCCCTCTGAATCTGTCGTAATAAAAGATAAAGCAGATTTAAAAGAGAGCAGCCCTTCAACAAGGAAACGAAATGACAGTGACATGTCTATGTCGAGTGACGGTAGTCCTGTTCGTACAAAAAAACAGCACGTGCTGGCAACTAAAGACGAAAAATTATCTCCAACTAAACCAAGACAAGATAGACGCGATGTTCCAAGATCGCAGTCAAAAGAAAAGAGATGCAGGTCCACAGACCGTTATGACGTCAAATACCGACGTTCAAGATCACACTCACGGGGCCACAGAAGGAAAAGGTCTCATAGCCGCAACAGGTCGCGATCTCGGGGACGTTTCAGAAGATATAGAAGATCAGACTCCCCGTATAGAAGGAAAAGAAGATCGCGGACGAGATCGCCGTATCGATCAACAAGACGCTCTCCGTCCGTGCGAAAGGATTACCGCTCGACTCGCACCCGATCGAGGTCGAAACACGCAGAGAAGAAATCGAAAAGCCCAATGCCGAAAAAACGGATCAGTCCACAGAGAGCCAACGCTGAGAAGCCGTCCAGGTCCCTAACACCTCCACCAAGAAAACCGACCGTCTCAGAAAGCTCTGATTCCTCGACGTCTTCCAGTTCGACTTCATCGGGCGCTTCCTCGGCGTCGATCAAGTCGAGATATTCTTGCAGTCCGTACAAAAAAGATGAGAATTTCAGGAAAAACTACAGAAACTCATTTAGTTCTGAAGACAGAGAAAGCAATACTCCAGTAGAGGAGAGGCGGATCGTTTTCGTCGGCAGATTAGAGAAAGATCTGACGAAGGCGGCTCTGAGGGCTCAGTTCACCAAGTTTGGGCCGGTCACTGAAGTCAGGCTGCACTCCAAGGAAGACGGTTCTCGTTACGGTTTCGTAACGTTCCAACGACCTCGCGACGCGTGGTCCGCGGTAGAGGCCGCTTCTTCTTTCCCTCAATACGACGTGGGCTTCGGCGGCAGACGAGCGTTCTGCAGGCAGAGCTACGCTGACCTTGATGGTCTAGAGGCGAAGTACACGGAAAGCGCTTTCCACGGCCAGGCCGCGATGCCGGTCCGCCGGAACGAGGACATGTCGTTCGAGCAAATGTTGTTAGATATAAAAAAGAAATTAAATAAAAGGAAAGGCGACAAAGCCCGCCAAGACGATGCTTGA

Protein sequence:

>DPOGS204068-PA
MESHILNMYHQAPYRNIGHNILRSISESLSSEGSCNQNSPEQQADENEVYWTRNTQVWSQNQNVNITQSNQDISVDVDENIEVQEVSIDDRNSETDGQLDTDNMEENSLVEGDDYDIIQKKIIHQMKGNTSILKIRSDTEPSVSQDSLELNFDAPVVSNVDEYFIKQNDEKHILEVKDEPVVKDVDEYFIKDTKDMPTIPPPTIVEELLVKSKLPETDFRISKTVPDEIFESEPTVDVKEIETVEQQEDSVADVSINESDLEVLPNIEDLKRYLLDDLPYTKLKNVQKSYSVSLPHSPMHNILDIDSKTCLSFEDLNLDLSDLTFENEKEKSSASLRSDDMPRTLTEEDVNSFLITNQTESKEVVTVDDCCPQDMEIDRPLGAIISNGDVQTVILPDRKCTSTPIPKPNVLEFCIEKVAVKKEPDIKVETDDFVDVESCNDAVIPVLEANNLNSLLEQFEATEKLNKRRKLSVNVSDSKTKTISITSGMRLQDAGVQLNKTKMRQILMPSPLNTVMRRSPSPIHSDHDYCSSKKRLSLPNLKGGQSLLKPEVLSSNNKILSSRHRSCKNKKVVYHLSSDDESDANTAKKNKILNNNRVADDVFVKNNKKSNIKLTVKAAANSNHRKKQSPPRSVNDCDVNKDNGSVMVKNAFKASDTSCSQNCNGSIKLTIKNKSEVIIKNCDFKDNRKDNVDKNKFVGVSANRFLNDINNSNKGIDTLDRNKTKDLNRVEKHFNVIAKQEVNSKEHFYTALFNDKQDIELPQIKAENNVKDEQTQAESLNDLEQPQKKKKLNLQEYKLRRNVSSNASSAQVSPEAIFPDIPCNINLDKNLRAVNNQTASDVVSAPKEPLISEAPKTIFDPIREASRKILMNSKKQKAEAMRKRDEDIVMSKIPKVENLELQPLISDAEMMKIVGMTPKILPVPIVPPTQTVVEDKVQLKDHDEIVLVSIGTNTDENMFKQIDKVIESKKRKSSSPKHENKMTINFKIKKSDPVLKQNVFDTVKRSKSPINEKNHSEVKIDKERLKDITATLKSVEKQVDTKISSNSLFASIQDVVMKNAPTADITKAEKSPKHSSVEKRDAHHKYKTSIVRQYDNSDDHGEDKIILHLEKNRKKPDQANVEVQTDSPSESVVIKDKADLKESSPSTRKRNDSDMSMSSDGSPVRTKKQHVLATKDEKLSPTKPRQDRRDVPRSQSKEKRCRSTDRYDVKYRRSRSHSRGHRRKRSHSRNRSRSRGRFRRYRRSDSPYRRKRRSRTRSPYRSTRRSPSVRKDYRSTRTRSRSKHAEKKSKSPMPKKRISPQRANAEKPSRSLTPPPRKPTVSESSDSSTSSSSTSSGASSASIKSRYSCSPYKKDENFRKNYRNSFSSEDRESNTPVEERRIVFVGRLEKDLTKAALRAQFTKFGPVTEVRLHSKEDGSRYGFVTFQRPRDAWSAVEAASSFPQYDVGFGGRRAFCRQSYADLDGLEAKYTESAFHGQAAMPVRRNEDMSFEQMLLDIKKKLNKRKGDKARQDDA-