Monarch geneset OGS2.0

DPOGS211173
TranscriptDPOGS211173-TA4425 bp
ProteinDPOGS211173-PA1474 aa
Genomic positionDPSCF300007 + 332230-340864
RNAseq coverage158x (Rank: top 52%)
Annotation
HeliconiusHMEL0172310.050.18% 
BombyxBGIBMGA003159-TA6e-14941.80% 
Drosophilal(2)k14505-PA2e-5948.23% 
EBI UniRef50UniRef50_UPI0002063A406e-8840.75%UPI0002063A40 related cluster n=1 Tax=unknown RepID=UPI0002063A40
NCBI RefSeqXP_393942.21e-8840.75%PREDICTED: similar to ubiquitin conjugating enzyme 7 interacting protein 3 isoform 2 [Apis mellifera]
NCBI nr blastpgi|3838575332e-8738.90%PREDICTED: uncharacterized protein LOC100878261 [Megachile rotundata]
NCBI nr blastxgi|3287795892e-9025.87%PREDICTED: hypothetical protein LOC410462 [Apis mellifera]
Group
Gene OntologyGO:00434618.2e-36proton-transporting ATP synthase complex assembly
KEGG pathway 
InterPro domain[86-252] IPR0233351.1e-38ATPase assembly, ATP12, domain
[28-148] IPR0114198.2e-36ATP12, ATPase F1F0-assembly protein
Orthology groupMCL26275 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211173-TA
ATGTTCAGCAACAAAATTGTGAAACAGTTTATCGATAAACAATACCTGACATGGTCTAGATATCAGAAATATGCAACTCACAAACGTTTCTACAGGCGAACAGATATCGTTCAAAATGAAAAGCATTGGGAAGTAACTCTAGACCATCGGCGCCTAAAAACTCCAAATGGTCGCGTCTTTACCGTAAACACAGAACCACTTGCACGGGCAGTAGCTGTTGAATGGGACGCACAACACGAATGTATTACACGGCCTACTATGCATCTCACTGCACTGTGTAATACAGCTATTGACAATCCAGGGAAGTTAAATTGCCATGATATTACAAGTTACTTGTTAGACTTCATTGCAACCGACACATTACTTTTTTATTCAGAGGAAGAAGAATTAAGGAAATTGCAAGAAAAAAAATGGGAACCAGTACTGGAATGGTTTTGTAAGAGATTTGGAGTAACACAAGAGGTGTCTAAGGATCTGGAGCTTCCACCCATACGAGCTGAGACCAGAGCAGTACTAGCTCGACACTTCCTATCCTATGACTTTCCTTCCTTAACAGCATTAAACTTTGGTGTTGAAGCCTTGAAGTCACCAATTCTTATGCTGGCTTGTGTTGAAAGGCACTTGGAACCCAAAGATGCGGTGATGCTTGCTAGATTAGAAGAGGAATACCAGGTGTCTCGATGGGGCCGCGTGCCGTGGGCTCACGAGCTGAACCAGGCTGAGCTGACTGCGCGAGTCTCCGCCTCATTGGTGATGAACGACGAAATTGAGGAGGAGGCACCTCGCCCCCGTCCCCTTTCCGGTGGCATTTGGACCCTGTTCTCATGGCTACGACGCACTGATAGGTCATTCTCAAGTGAAAGTATCAGCAGCGTTGGTTCCGATAGAACTGTTGCCAGTTTCGACTTCCTAGCTCCAATACACTATAAAGATAACCCAGGTATATTGTTACCTCAGATTCCAGAGACGGAAACATATAAGAAAAGGTTGAATGAACGAAACTTGCGAAGGAAATACGACCGTGATATAACTTTGCGACGTAAGTACGGCTTATTCAGAGAAGAGGTCACCAGTTTCGACGGCTTCAGCTTACCTTCTAAAAGACTACAAAGAGTTTCACCAACTGGAAGGGATAGACGAGCCACCAGTGAAATCTTCCATCGCAGAGCTCCGTACGTTCCTGGTAAGCGACGTGCCCCATTACCACCACAAACCGTAGTTTCAAACACTTTACCTCGTAGTTATAAAAGAAAAAGGCAAGCACCTAGGCCGCCCGTAAAATTTTCCGACGAAAATATGGAAAGCAACAATATCGATAAGAATATGTTAAGAGAATCTAAATCCACGAACACTAAATATATAGTCACAAGTCACCCTGAATCAGCAATGCAACCAGAAAATGATTTGAAAATTCCACAACCTAAGGAGGTCAAGTTACGCAGCGAGAGGAGTTTTCTAAAACAAATTTTTGATAACCGGAAAAGGAACTCGGCGATTGATACAAGCCATGTGAAACTACTGCCAAGTATAAGTGAACTTGATAAACAAGCAGCAGAGATAATAGCTACAAATAAGTTTCTTCATGGCGAGAGCAGTGGGCATAAGGATGATAACTTTTCTAAATTGCCGATCCCAACATCGTCCAATGAGAAGAAATGGATATGTAAATTGTGTTATAGAAAATATGACGCCTCCATTGTTTCGTGCGTATACTGTATTACAAGTAATAAAAATAAAAAAATAGCTTCCAATATATACACACAGACAGATGTTAAAGCGATGTCTAGCAAAGACAGCGAAATTGACGAGAAGAAGAAGTTAAAGGAAATGTTAAAAGAAATGAAAGATTCCTTGCCCAAGAGACCAAAGCATAACGTTAATAATGAAAAAACTAATATTTCTTCAACAGAAACACCTACCCTACGAATTGGGTCGGCCATAGAAAATATTGGTGTAAACGCTGAACAGTCAACAAATTCCAAGTTTCACAATCAAGACAAAAAAGTCATAGAAAGTAGAACTTGCAATAAGATTTTGAATTTACCGTCAGAACCATCTACCTCCGATCGACACACTGACATTTTTGTTGGTAACAGTTCTAAAGTCACAAATCAATTCAGGAATCCTGATGGAAGTATTAATCACGGTTCTCATTGTGACGATGTAAAAATGGGAATATCAAATTCTAGAAAATATAATTTTCTTCTGGGCGCCGATCCAAAGCCAACAAATAATTTAATCAACAAACAATCATCCGATGGTTTGTTGGCTGAGACGAATAAAAATAGTGTAATAATTGACAAAAAAACAATGGTTTACAATAATGAAGATAAGACTAAAACAACAGTAAGTCAACAACCACAAAGTTCAGGGTTACAAATCATTAAAATAAATAAAAAGTCAGACGTGCCACTCAAAGGAAACCGGACTGATGTAGAGATTGTTAACACTGATAATAAGTCGAAATCAACTAAAATTTTAAAAATCAATCAATCTTCTTCAAACGTACAATCTAAAAGCTGTATCAAAGACGAACCTAAAAGTCGCAATGCAACAACGGAACAGAGCACAGAACAGTCGAATAAAAAAACAATTATATTAAAGGAAACAAAAATGGAAAGTGTCCGACTTGGTGATGTCATATCATCAGAAAACATCCGTTTAAACGATACCAACAAAACCGTGCCTGATAGTAAATTGAACACGCCACTTAGGATATCATCATTATTAAATCCTTTATACTGTCCAAAAACGGAGGTTTCAAATAATAAAATTGAAACCGAGATCTTACAGACAAAACCTGATAGTAGTGGTAAAAACGGTGACGTAAATAATACTATCTTATCATCTAATAGTTCTGCTGCAAGTGCTTCCAAGAAATCCAATGAAGTCCATGGTGCTAGCACTTCTAAGCAACTTGAAAATGATTTGCCGTCTAATAATCAACTAAACACTGAAACTTCGGCTACAAAAACCAACTCCAATATAGATTATTTGAATCTTCATGCAAAACGTAGGGACCTAATAAATCAATTAGAATCAGCCATTTCTAAAGGTGACGAAAAAACCGCAGCTGATGCGGCGGCGAATCTCGCTAAGCTAAAATTGTCTTGCTCTGTTTTATCATTTTCTTCTCAAATAGTTGGATTAGTCAAAAATACACCAAATGACGCTAACAAAATTGAAAAAGTTGCTCAAGATGGCAACATTAACGCAGTAAAAATAAACGGCTCTAAGGAAGTTGAAAAACAACACCGTGAACAATTGACAATTACTGGATCGGCAGTTCTAAACAATTTCCAACCGTCAACATCTAGGGGAGACTCAGTCGAAGATATGGTTCAGATTGAAATATGGGTGGAAGATAAAGAGGCCGCTAGGGGTCCTATACGTATGAAAGTTAAAAGGAAAGCCGTGATGGGGGATTTGAAGCGACAAGCTGACGCAGTACTCGGTCTTGAAATACGTTTACAGCGGTGGATTATTGGCAAAAACCTGTGCACTAACGACGATACACCCCTCCTCACCCTGGCAGGTCCTGATTTCAAGGCGCCTTTCTATTTGTGTTTAGTTGAACCTGGTACTAATAAGGATGATACCAGTAAAGTCAGTGAAATTTCTAATGACAATAATAAAAATGTGCCAAATAAAATAAACAATAAGGAAACGGGTGACGTTTACTCGGAGTTAATAAAATTGGAACAGCAAGCACTCGTGCCAAATACCGAAGACTTTGAATGTGGCGTGTGTATAGAGCAATGTCCGATGGGAAACGGTGCGGTCCTACGGGAATGCATACATACATTCTGCAGGGAATGTCTGTCGGATGTCGTACGACACTCCCAAGAGGCGACCGTTTCATGTCCAGCTATAGGCTGTCCTGGGACGTTGCAAGAAAGAGAAATACGCGCCCTCCTAACTCCCGAGGAATATGACCGTTGGTTGGCGAGGAGTCTCAGCACTGTGGAAAGTGGAACTCGCAATACGTTTCACTGCCGCACTCGCGATTGCACCGGCTGGGCGTTTTGCGAGCCTGGTGTTAGGAGATTTCCCTGTCCTGTTTGTAAACATGTGAACTGTCTCCCTTGTAAGGCTGTTCACGAAAACGAAACATGCGAGACATATCAAGCAAGATTAAGTAGGGCTGCTACAGTGACGGATTCGAACCAAACAGACGAGGGGACGCGGGCTCTGCTAGACTCACTGATTGCGAAAGGCGAAGCACTGGAATGTCCTGAATGCAGCGCAATTATAACCAAGAAATGGGGATGTGACTGGATTAAATGCTCATCATGCAAGACGGAGATATGTTGGGTGACGAAGGGTCGCCGTTGGGGACCGGCTGGTAAAGGTGATACAAGCGACGGTTGCAAATGTGGTGTAAACGGTAAAAGGTGCCATCCTTTGTGCGGCTACTGTCATTAG

Protein sequence:

>DPOGS211173-PA
MFSNKIVKQFIDKQYLTWSRYQKYATHKRFYRRTDIVQNEKHWEVTLDHRRLKTPNGRVFTVNTEPLARAVAVEWDAQHECITRPTMHLTALCNTAIDNPGKLNCHDITSYLLDFIATDTLLFYSEEEELRKLQEKKWEPVLEWFCKRFGVTQEVSKDLELPPIRAETRAVLARHFLSYDFPSLTALNFGVEALKSPILMLACVERHLEPKDAVMLARLEEEYQVSRWGRVPWAHELNQAELTARVSASLVMNDEIEEEAPRPRPLSGGIWTLFSWLRRTDRSFSSESISSVGSDRTVASFDFLAPIHYKDNPGILLPQIPETETYKKRLNERNLRRKYDRDITLRRKYGLFREEVTSFDGFSLPSKRLQRVSPTGRDRRATSEIFHRRAPYVPGKRRAPLPPQTVVSNTLPRSYKRKRQAPRPPVKFSDENMESNNIDKNMLRESKSTNTKYIVTSHPESAMQPENDLKIPQPKEVKLRSERSFLKQIFDNRKRNSAIDTSHVKLLPSISELDKQAAEIIATNKFLHGESSGHKDDNFSKLPIPTSSNEKKWICKLCYRKYDASIVSCVYCITSNKNKKIASNIYTQTDVKAMSSKDSEIDEKKKLKEMLKEMKDSLPKRPKHNVNNEKTNISSTETPTLRIGSAIENIGVNAEQSTNSKFHNQDKKVIESRTCNKILNLPSEPSTSDRHTDIFVGNSSKVTNQFRNPDGSINHGSHCDDVKMGISNSRKYNFLLGADPKPTNNLINKQSSDGLLAETNKNSVIIDKKTMVYNNEDKTKTTVSQQPQSSGLQIIKINKKSDVPLKGNRTDVEIVNTDNKSKSTKILKINQSSSNVQSKSCIKDEPKSRNATTEQSTEQSNKKTIILKETKMESVRLGDVISSENIRLNDTNKTVPDSKLNTPLRISSLLNPLYCPKTEVSNNKIETEILQTKPDSSGKNGDVNNTILSSNSSAASASKKSNEVHGASTSKQLENDLPSNNQLNTETSATKTNSNIDYLNLHAKRRDLINQLESAISKGDEKTAADAAANLAKLKLSCSVLSFSSQIVGLVKNTPNDANKIEKVAQDGNINAVKINGSKEVEKQHREQLTITGSAVLNNFQPSTSRGDSVEDMVQIEIWVEDKEAARGPIRMKVKRKAVMGDLKRQADAVLGLEIRLQRWIIGKNLCTNDDTPLLTLAGPDFKAPFYLCLVEPGTNKDDTSKVSEISNDNNKNVPNKINNKETGDVYSELIKLEQQALVPNTEDFECGVCIEQCPMGNGAVLRECIHTFCRECLSDVVRHSQEATVSCPAIGCPGTLQEREIRALLTPEEYDRWLARSLSTVESGTRNTFHCRTRDCTGWAFCEPGVRRFPCPVCKHVNCLPCKAVHENETCETYQARLSRAATVTDSNQTDEGTRALLDSLIAKGEALECPECSAIITKKWGCDWIKCSSCKTEICWVTKGRRWGPAGKGDTSDGCKCGVNGKRCHPLCGYCH-