Monarch geneset OGS2.0

DPOGS207517
TranscriptDPOGS207517-TA3324 bp
ProteinDPOGS207517-PA1107 aa
Genomic positionDPSCF300177 - 72798-77466
RNAseq coverage92x (Rank: top 62%)
Annotation
HeliconiusHMEL0055200.080.18% 
BombyxBGIBMGA001939-TA0.056.71% 
Drosophila% 
EBI UniRef50UniRef50_UPI00020647B03e-7728.14%UPI00020647B0 related cluster n=3 Tax=unknown RepID=UPI00020647B0
NCBI RefSeqXP_001181608.12e-4928.75%PREDICTED: similar to XK-related protein 6 [Strongylocentrotus purpuratus]
NCBI nr blastpgi|3838473905e-8329.89%PREDICTED: uncharacterized protein LOC100883531 [Megachile rotundata]
NCBI nr blastxgi|3504126884e-8927.92%PREDICTED: hypothetical protein LOC100749324 [Bombus impatiens]
Group
KEGG pathway 
InterPro domain[28-347] IPR0186291.2e-62Transport protein XK
Orthology groupMCL26663 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207517-TA
ATGGCAGAAGCTTGTAGGCTTGGAACTTTAACTAACAAAAAGACTGCATGCACTCAAGATGAAAACTATTGTGTTTTTAATGTTTTGAGTGTATTTATATCACTTGTATCTATACTTGCGGATTTAACTACTGATGTTTTAGTATTTGCGGAATATTGTAAGGATGGTTATTTATATTGGGCATTCTCAACAATGATATTTATATTGTGCCCGAATATTTTAATTAATATTTTTTCACTAAGATGGTATATAATTGATACGCAAGTTAATCTTCGTAACTGGATAACACATTGCTGTTTGGTTGGATTGTTGGAGAGATATATTATATTTTTGCAGAAGGTATTCTCATGTAAAAATTTAGAGTCAATGAAAACACAGAAGTTAATAAACCAGAGAAATGATCTTAGTTTGTTGCATTTCCTGTACATATTCACGGGGACTGTACCACAAATTATATTACAGACATATTCTGTTGTTATGTTTAATGGAAATTATCATACAAAAGTGTTTGCTCTGGTAGCATCGATAGCATCTGTGATATGGGGTTGCGTGACATACATATACAGCATCACCACATTCACCACCGATAACTTCAAATGGAGTTCCATAGTTTTGAAACTGACATGGTATTTTGGAATTTTGATAGGGAGGATCGCCGGTATATTATTGTTCAGCATAATTTTCGGTATATGGGCACTTATACCATTAGGCTTACACTGGTCCGCGATGACGTTTTGGATAATCACACAGAAGACGACGTTCTGTCCAAATAAATTAGAAGAGACCTTTTACAATGCCGTGATGGGCTTCGTGTATTGTTTCTGTTACATCAATCTTAGAGAGGGTCACACTAGATATAGACTGTTGATGTTCTACACTCTGATAGTCACCCAAAATTTCGGCAGTTTGTTCCTATACATACTGATATCAGACTTGGAGATACAACGGAGGCCGTGGTCAATCGCTGCCACCGTCTGTGTGATAACTGGAACAGTTATCGGTATCGTGGCAATGATGTTGTACTACAGATTTTTTCATTCCAAAGGACCCATACCATGGAAGAATTCCCAAGACGACGTTGAATTAAACAATAAAACTGCGACGCAGAGTAACGATACAAAAAATAATAAAATTAAAGGCTTGAACAATGTCAGATCATTTAAAACTTATAACACAGCCGTGGAGGCAGAAAGGAACGATACGACAAGCGGCAACCAGCCACTAGCACAAGGCGATAGTCAAAATGCTGATAAGAAATTCCTATTAGACCATTGGATTGCAAGATCTGGTTCACCCACTTTCACATCAGCGCCGGTTGGGAACGCTAGCATGGATTCCAGTATCGAGCTGTACACGGTCGAGAACACTCCGCAGACGATCCTCAGAGAGAATACGAATAACACGACGAACATCAAACCCAAAAAGAAAATATGTTTTCCAACCGAGCTTAACCTGAAATTGTCGTCATTGGAAAACTTAGAAAGCTCCATCGATGAGCTGACGAAGACCACCCTGAACATACAGAAGCGACGTGGTATCTGCTCGTCTGACGAGTTGAGGGACGATTTGAACAATCTCAACCTACAAGCCTGTAATATTGTGAACGAGGCGTTCGCGAAATCACTGAAGAACGACCACAGTTCGGAGCTCAGTCATGACACAGACGGCTCCAACAAGAAAACCTCATCGGATTCCATCGATATGGATTTAGATCTAAGTTTCAGCGACATCAACAGCTCCGACATAAACACGTTCAACGAGAACATAGTGCAGAAGCTCTGTCTGAGCGCGCTCAAGAACATTAAAGTCGGCAACCAGGACGCGGATATCGAGAACATACAAAGGATAGCACTAGACATACTCAAAGAAATGTATACGAAGAAAGACAAGGTAAAAAACGGTATAACGACCGACCTGTTCCAGTCCAAGCGAGATCTCGATACGCCAACCGAGATCCTGTCCGTGCACGACTACGAGAACATCTGCGCCGTCAACATAGCGAGGGAAGCTTGGGGCTTACGGTCTTGGAATGGTTACTGCGATATCGAGAACTGGTTACACGACGGCAGCGTGGTTAGAGACAGAAGCAGAGACACGCTGACTTCAGGCAGCTCGGAAATAAGCAGCTGTTTATCGCAGGAACTAAAAAATGCTATACTATCAGCCCCATCATTCCCCAGGAAACCGCACACTTATTCAAATGTCTTCGTCAAATTTGAGAGAAGCAGCCAGGAGGATTACACCAACACTATGATGTGTCATTCAAACGATGACACCTTCTTAGCTAAACCATGTATCATAGATCTGACTAAAGACCCCAATCATTTGGAACCGATACTAGAAGAACTGGACGATCTGGGCGATATTGATGTTTCGTGTAAGAAAAGACTGAATTCTGAGAGCTCTCTTGTGGCTACTATAGACGAAATAAGAAAGGCCACTACAACTAATTCCCCTAGAAATCCATATTATAGGAGGGAACAATGGGACAGTCAGTGGGATTCACCTCAATTCCATTCAGACACCACGGAGATTAGTTTAAAGAAAGCCATTTGGAAAGAACCGTATAAATTTAATTCAAGCAATAAAATGGACTCATTACAGGAGAAAGACATCGCCATTGGTTCGTCTCACAGCTCTAAATTTGATTCCAGCAAGGATTTGAACGTCCTGCCGCTATCTGCTAAGCTTAACGTTAATACCGCCCAAAACAAAAACATTCCACGGAGGCAGAAGATGAATTCTACTAAAAATCAAAATAAGAAACTTAACAATAAGTCGGAACAAGTCCACGACCTTTCTAATAAAGATATAGACTTCTTGTGGCAGTTGGTATCGGAGAATTCTCTGCCTATAAACACGGTCGATGAGTTTACGTTAAGTCGCAAGACTCCGTCATTACAATCTCTTATACAAAAAGACACTACAACGAATTTAGTCAATGCAACCACCGCTAAAATATCAAATATTAAGAAGAACGGCAGAACGAGACCGAGGAGGAAATTCTCCATATTACGAGAGAGGTTCGAACCGAAGACAAACTTACACTCCGATGAAGAAATCTTATATCAATGCTCGGAAAATCTATCCCAAAACTTTGACTCCAACATAATGAACGTGAGGAACCAGTTCGACGCTCACAGCGATAGTTTAAATATAAAACCACCTCTGAAACAATCAAACGAACTATCGAATGATGTTAAATGTAATTCGATACTAGAATCTCAAACTTCTTCGAAAAATCTCAAAGATAAACGATCTATATTCATGAAGCAAGTACTAAGCCCGCCGAGGTTTAATACCAAGCTCAAACATAAGGTCTAA

Protein sequence:

>DPOGS207517-PA
MAEACRLGTLTNKKTACTQDENYCVFNVLSVFISLVSILADLTTDVLVFAEYCKDGYLYWAFSTMIFILCPNILINIFSLRWYIIDTQVNLRNWITHCCLVGLLERYIIFLQKVFSCKNLESMKTQKLINQRNDLSLLHFLYIFTGTVPQIILQTYSVVMFNGNYHTKVFALVASIASVIWGCVTYIYSITTFTTDNFKWSSIVLKLTWYFGILIGRIAGILLFSIIFGIWALIPLGLHWSAMTFWIITQKTTFCPNKLEETFYNAVMGFVYCFCYINLREGHTRYRLLMFYTLIVTQNFGSLFLYILISDLEIQRRPWSIAATVCVITGTVIGIVAMMLYYRFFHSKGPIPWKNSQDDVELNNKTATQSNDTKNNKIKGLNNVRSFKTYNTAVEAERNDTTSGNQPLAQGDSQNADKKFLLDHWIARSGSPTFTSAPVGNASMDSSIELYTVENTPQTILRENTNNTTNIKPKKKICFPTELNLKLSSLENLESSIDELTKTTLNIQKRRGICSSDELRDDLNNLNLQACNIVNEAFAKSLKNDHSSELSHDTDGSNKKTSSDSIDMDLDLSFSDINSSDINTFNENIVQKLCLSALKNIKVGNQDADIENIQRIALDILKEMYTKKDKVKNGITTDLFQSKRDLDTPTEILSVHDYENICAVNIAREAWGLRSWNGYCDIENWLHDGSVVRDRSRDTLTSGSSEISSCLSQELKNAILSAPSFPRKPHTYSNVFVKFERSSQEDYTNTMMCHSNDDTFLAKPCIIDLTKDPNHLEPILEELDDLGDIDVSCKKRLNSESSLVATIDEIRKATTTNSPRNPYYRREQWDSQWDSPQFHSDTTEISLKKAIWKEPYKFNSSNKMDSLQEKDIAIGSSHSSKFDSSKDLNVLPLSAKLNVNTAQNKNIPRRQKMNSTKNQNKKLNNKSEQVHDLSNKDIDFLWQLVSENSLPINTVDEFTLSRKTPSLQSLIQKDTTTNLVNATTAKISNIKKNGRTRPRRKFSILRERFEPKTNLHSDEEILYQCSENLSQNFDSNIMNVRNQFDAHSDSLNIKPPLKQSNELSNDVKCNSILESQTSSKNLKDKRSIFMKQVLSPPRFNTKLKHKV-