Monarch geneset OGS2.0

DPOGS214770
TranscriptDPOGS214770-TA3360 bp
ProteinDPOGS214770-PA1119 aa
Genomic positionDPSCF300022 + 1464989-1480084
RNAseq coverage363x (Rank: top 33%)
Annotation
HeliconiusHMEL0135900.059.48% 
BombyxBGIBMGA000174-TA0.069.26% 
Drosophilaorb2-PG8e-7027.52% 
EBI UniRef50UniRef50_E1JI833e-6727.64%CG5741, isoform E n=13 Tax=melanogaster subgroup RepID=E1JI83_DROME
NCBI RefSeqXP_001956859.12e-7527.78%GF10141 [Drosophila ananassae]
NCBI nr blastpgi|1947488595e-7427.78%GF10141 [Drosophila ananassae]
NCBI nr blastxgi|3320255123e-10328.48%Testis-expressed sequence 2 protein [Acromyrmex echinatior]
Group
KEGG pathway 
InterPro domain[814-903] IPR0194111.7e-16Domain of unknown function DUF2404
Orthology groupMCL26356 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214770-TA
ATGGAAGTTTCAGGCGCAGGGAAATCCCCAAATACGTCACTATCATTTAGATACAATGCTAATAATGAGGAACTGGAGGAACTTCTTCAAGCCTGTGAGGACGACCCCCCAACACCACAAGCTGAGCCAGCTCCGGTTAGGAGTGAAAGTGCCAGTCCTAAAAGAGCCGAGAAGAATATATCCATCATAGATAAATATTTCAAATCTATGCGTACAGAGAAAACAAATGAAAAGACAGAGGATATCAAACCAGACATCAAACCGGAAGAAACTCGTTCTATAATACCTCCCAAAGAAATCACTTCATCCCCTATGAAAGAGTATCTAAACCGGCTTGGAAAAAGAAACACCCCAGAGGTCGCTCCGGAGGGAAAGGAGGCTAATGAAACTTGGAAAATATTTCATGATTTTAAGTTCAAAATAGCTCAAGCTGTTGAAGATATGAAAACGCGTTCTGTTGAAGACACTAAGGACAAAAGTCTACCGAGAGAAAACTCGACATCAGATTCTGAAGAGAATTCAGCTGTTAAGGACTCGGATCAGCAGAGTATGGGTGATACGGACAATCAAGTGTCTTCATTGGACAGCAGTATGCAGAACTTATCTGACATGACTCAGCCATTAACGGCCAAGCCTATGGAGTTTGAGCTGATAAAGGAAACGAATTTAAACAAAAGTCACTCGGAATCCAGTGATGATACGCATAAGAATTTACCATATCAAGAATCGGACTTAGCGAGGGAATTCTTACAAGACAGTATAGAGATCGAGTCCGGTGTTGAAGCCCTCGAGGACACGATCGACGCTTTCGGCGACGCCAATCAGATAGAAATGGAAGACAATAAAAGAACAACAGAAACACCCACAAACAATGCGAACACTTCAAACTTCGGACAGCCGCCCAACAAGAACAACATACACAAAGACGAACCAAAGAAGAATTATTTCTTTAAATTCTCAATGTATTTCCTAACGATATTTCTATTCGTTAACTACGTGCTATTCCCTAATTCGAATATATGGAACGGTTTCCTATTGGGCATTTGGTTTTTCTGTTTCGCAACCAACGTCCGCGATTGGCTGATGGATAACTACTTCAGGGATTGGGAACCGAGGAAGGGTGGTTTCTTCCAATTGAAACAGAGCACAACACTACCATTTATATACACGATACCAACTATAAAGGAGCACAGGCCGTTGAAAAAGTTTGAGGGTTGGTTAAATCATTATAGGTTTCCAGACTACGATCCCTTCACGTATCACATCAATAAGACAACAACGGCTTTCGTCAAACTTGAAGGTTGCAACCTACGCATATCGTACACAAGGACAAAGGTTGCTAAGCGCGCTCTTTGGGACGAAAAGATTGAAAATGTAACTTTCTACCAGCACCGGCTATACAACCTCACTGGTGCGAGAGTCATATTGCTGCCGAAAGGTTTAGTTAAAAGGAGGCAATGGAGCAAGAAATATCCAATTTGCATAATATTGAACGAAAAGGAGAAGATACAGGTGTTAGAGAAAGAAAACACGGATAAGAAATCTGATACACAGGAGAAGAAAAATAAAGATGTACCAGCGGAAGTTGAAAGCGCTGAAACAAACACGACGCCCGAAAAGAAGAGGAAGTTCGTGTGGAGAAAAAAGGAGAAGAGACCCCCGGCCTACACGTCATGTCACACTCAGCTAAACCCCTTCCTTCCTAGCCCCTACAAAAAGATCGACAACACCTCACAGTGCACCGAACCCGAGACGAGTCATTCAGATAAGAACACACAACTTGAATATGAAGCGGCGGAAACGGAAATAGAACTGACCAAGACGGCCAACAGCAAAGATGAGGACGTGGAGCTCGATGAGAGCGAGCTAACCAAGATCAAAGAGTGTCTAGAGGAAACTGAGTTGGAGACGGGAGCAGATGGTGCGGCCGAGGGGGAGTGGAGCGTCCACGTCAAACACTCAAGAGATAAGCATTCTAGATTGTATCTCTTCGCAAGGACCGGAAGAGATAAACTCGAATGGTACCGCCGCCTCCTAGTAGCCGTGTCAGAGGCCCGTTCCGACTCTCCCGTTGAGGACGAACGACACACGGACGACAAGGACCCCATCGAGCTCGCTGTATACAAGCTCACTGAGAAGGATACTGTCGCCTTCGCTAAGTCCAAGTCCAACGAGCAGGTGTCTGAGGGGGTGACTATCAGCAGTACGGCGCAGCCGCTGCCGTCTAACTTTGACCTATACGAGAAGTCGTTTTGGCCGTACCTACTGAAGATTATACAGAACCACGAGACGTCCTCTAAACAGACGACGGACGCGGGCGTCATGTGTCAGATAGAGCCCACGCCGCCCGACAGGAGTAAGAGTAAAAAGAAGAAAAAGACAGCTTCCCAGGGCGCGGAGTGCACGTGTCGCGTGTTGCCGGCGGAGGTGTCGTGGGTGAACACCGTGCTGGCGAGGCTCATGTATGACGTCATGAGGGACCCCGCCATGGTGGCCCGCGTCCAGAACAGGATACAGAGGAAGCTTAACACGCTCAAGCTGCCGTCGTTCATGTCTCCGCTGGTGGTGACGGAGCTGGTGCTGGCGGGCTCGTGCCCAGCCGTGTGCGGGGTGGGCTCGCCCTCGCTGGACGCGCGCGGCCTGTGGCTGCACGCCCTGCTGCGCTACGACGGCGGCGCCACCATCACCATCCTCACACAGATCAACCTGCTCAAGCTTAAGGAGAAGAATCTCACCTTAGAAGATCAACTACTAGCGGCAGCCGAAAATACAGTCGAGAGCGATGCTAGCTGCTCTATACCGAGCACGCTGCTCACAGACAAGAAACGCAAACCGGCGATCTACGACTCGGAGGTGGAGGATTCGGCGGAGTCCAGCAGCGACGACGAGAGCCCGCCCGTGCAGCCCGTCGACAGCACAGAGAATGTATTGGCCGCGGACTCTGTATCGTCAACAAACGAAGGCGGCTCGTCCAAGAAGAAGTTCCTCCGTATGGTTGACAAGATAGCCACCAACAAATACTTCCAGCAGGTAACCGACTACAAGTACGTGAAGCGGGCCATGGAAGGTCTCAGCAACACGGACATCAAGCTGCAGCTGGAGGTGAACGGCCTGGAGGGGAGGCTCGCCATCAACCTGCCGCCGCCGCCGCACGACCGACTGTGGATAGGGTTCTCCACCAACCCGCAGCTGGTGCTGAAGGCGCGGCCGGCGGTCGGCGCGCGGGCGCTTCGGTTCGCTCACATCTCCAACTGGATAGAGCAGAAGCTCACCAAGGAGTTCGAGAAGGTCCTCGTGCTGCCCAACATGGAGGACTTCATCATAGACGTCATGTCACCCACGCCTATAGAGTTCGAGTAG

Protein sequence:

>DPOGS214770-PA
MEVSGAGKSPNTSLSFRYNANNEELEELLQACEDDPPTPQAEPAPVRSESASPKRAEKNISIIDKYFKSMRTEKTNEKTEDIKPDIKPEETRSIIPPKEITSSPMKEYLNRLGKRNTPEVAPEGKEANETWKIFHDFKFKIAQAVEDMKTRSVEDTKDKSLPRENSTSDSEENSAVKDSDQQSMGDTDNQVSSLDSSMQNLSDMTQPLTAKPMEFELIKETNLNKSHSESSDDTHKNLPYQESDLAREFLQDSIEIESGVEALEDTIDAFGDANQIEMEDNKRTTETPTNNANTSNFGQPPNKNNIHKDEPKKNYFFKFSMYFLTIFLFVNYVLFPNSNIWNGFLLGIWFFCFATNVRDWLMDNYFRDWEPRKGGFFQLKQSTTLPFIYTIPTIKEHRPLKKFEGWLNHYRFPDYDPFTYHINKTTTAFVKLEGCNLRISYTRTKVAKRALWDEKIENVTFYQHRLYNLTGARVILLPKGLVKRRQWSKKYPICIILNEKEKIQVLEKENTDKKSDTQEKKNKDVPAEVESAETNTTPEKKRKFVWRKKEKRPPAYTSCHTQLNPFLPSPYKKIDNTSQCTEPETSHSDKNTQLEYEAAETEIELTKTANSKDEDVELDESELTKIKECLEETELETGADGAAEGEWSVHVKHSRDKHSRLYLFARTGRDKLEWYRRLLVAVSEARSDSPVEDERHTDDKDPIELAVYKLTEKDTVAFAKSKSNEQVSEGVTISSTAQPLPSNFDLYEKSFWPYLLKIIQNHETSSKQTTDAGVMCQIEPTPPDRSKSKKKKKTASQGAECTCRVLPAEVSWVNTVLARLMYDVMRDPAMVARVQNRIQRKLNTLKLPSFMSPLVVTELVLAGSCPAVCGVGSPSLDARGLWLHALLRYDGGATITILTQINLLKLKEKNLTLEDQLLAAAENTVESDASCSIPSTLLTDKKRKPAIYDSEVEDSAESSSDDESPPVQPVDSTENVLAADSVSSTNEGGSSKKKFLRMVDKIATNKYFQQVTDYKYVKRAMEGLSNTDIKLQLEVNGLEGRLAINLPPPPHDRLWIGFSTNPQLVLKARPAVGARALRFAHISNWIEQKLTKEFEKVLVLPNMEDFIIDVMSPTPIEFE-