Monarch geneset OGS2.0

DPOGS203162
TranscriptDPOGS203162-TA3495 bp
ProteinDPOGS203162-PA1164 aa
Genomic positionDPSCF300035 - 788785-797732
RNAseq coverage343x (Rank: top 34%)
Annotation
HeliconiusHMEL0109710.079.31% 
BombyxBGIBMGA011009-TA0.070.65% 
DrosophilaCG1104-PC0.046.48% 
EBI UniRef50UniRef50_D6WAW00.048.54%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WAW0_TRICA
NCBI RefSeqXP_969461.10.048.54%PREDICTED: similar to Uncharacterized protein KIAA0776 [Tribolium castaneum]
NCBI nr blastpgi|910777920.048.54%PREDICTED: similar to Uncharacterized protein KIAA0776 [Tribolium castaneum]
NCBI nr blastxgi|910777920.048.68%PREDICTED: similar to Uncharacterized protein KIAA0776 [Tribolium castaneum]
Group
Gene OntologyGO:00550853.2e-20transmembrane transport
GO:00160213.2e-20integral to membrane
KEGG pathway 
InterPro domain[9-282] IPR0186114.9e-96E3 UFM1-protein ligase 1
[648-1164] IPR0161961.4e-45Major facilitator superfamily domain, general substrate transporter
[791-1044] IPR0117013.2e-20Major facilitator superfamily
Orthology groupMCL13148 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203162-TA
ATGGCTCCTTCAACTGATTGGGATGAAATAAAACGTTTAGCAGCAGATTTTCAAAAAGCTCAATTGAGCACAACAGCTCAAAGACTTTCAGAGCGAAATTGTATTGAAATTGTGTCCAAGCTAATAGAATTAAAGTTAATAGATGTTATTTTCACAATTGATGGGAAAGAGTATTTGACACCACAGCAATTAATAAGGGAAATAAAAGATGAATTATATGTTCGCGGGGGTAGAGTTAATACAGTAGATCTTGCTAAAGAGTTGAATGTTGATTTGAACCAGATAAACTTAAATGTGACTGAAATAATCAAAGGCAAAGAAGTGCAATTAGTTTCAGGATCTTTAATTGCTCACTATTATTTGGAAAAAATAGCCAGAGAGATAAATGAGAAGCTACAATTACAGGGTCAAATTACTGTTGGTGATTTGACATTGCAGTATGACTTGCCAGCAGATCTCTTACAGCATGGCATATTAGAAAAGTATTTGGGTAAAATCATTAATGGGAGACAAGACCCCTCTGATCCAAGAGTGTTCTATACAGAAGAATATATAACAAGGACTAAAGCTAAGATACGAGGAGCTTTAATGGGCCTCTTAAAACCCACACCTATAAATTTAATTTTAAGCCATTGTAATTTAACAGAAAGGCTATTTATGTATTTGTTTGATCAATTAAATGCTCCTGGAGTACTGACTGGAAGACAGTCAGGGGCGCTTTATGTTCCATCTTGTTATACAAAATCTCAAAATGATTGGGTTATAAATTTTTTCAAACAAAACAATTACTTGGAGTATGATGCTTTGACGCGCTTGGGTATTTCTGATCCAAAGGGTTATGTTAAGAGAGTTCTATCAAATGAGAATATCACCTTTTTAAGCAGCTGCATAATCGGGTCTCAAATTAAACAGCAGTTGGAAACTGCTTTAGAGGAATGCATTGCCTCAAAGAGTTATCTTGATGTAGTATCACTATTACCATCTGTATTGTCAGATGAAGATATTGAAAATGTGCTTGATGCCCTTCTGAAAACTAACAGCTCCACAATTTTATTTGATAAAACAGTTTTCAGCAATCAGTATATTGAAAACCTTAAACAGGCCTGTTTGCCATTAGCCCAAAAAAATGCAGAGACTGTGGTTAAATCTGGTAAATACCAACAGTTTTATTTGGAAAAACAACTAGTAAAAAATGAGGCACAGCAGAGCCATGTGGATCATAAAGCTGAAAGACGCGAAGAACGTAGGAAGAAGGCCAGTTCAGGGAAAGGGGGTGGTGGTACTCAGGGCAGAGAAACCAAAACTAAGGCAGTGAAAAAACATCCAAGATCCAAGCAAGTAGTACAAGATTCTGATTCTGATGAGGCACCAAGTGTAAAGAAGACACCAAGTCAACTTGAAATAGTAAAAGTTGAAGACATTGAGAATATAATTAAGGAACCACTTGAAAATGAAGGTCTTGATGAATTGGTAACACCAATCTCTGAATATATTCAAGGACATCTCAATCAAACAGCTTTGGCAATAGCTAAAGATCTCACAGAAAAGTTACTCCAGGATGCCAATCAAAACAGGAAACAGACTCATTCGTCTGCTCAAGATAAAATCAATATTCTTGTCAATGATATTAAGTTATATGAAAAGGGTCTAAAATTGTTTCCTAGCGACCAACAGGTGCAGTTTATAAAATACCTCTTAAAATCTTTTGGTGGTGACATTCTATCAGAATTTTGCAAGTATGCAGCAAACCAGAGTAACCTTTCTGTACCAGTTGATAATTTATCAGTTGAACAGAGGAATAAAATTATGAATGATTTACCAGAAGAATATATGAAACCAATTCGTGCCCTTAATTCCACATTATCTGAGCAGAATATGGAACAGTTTTACCAAGCAGTTGATGTGTGCTTGGCTGAATGTGGTATGATTTTGAAAAAGGTTGATAAGAAGAAAGACAGACTTTTGGTTCAAAACCATAGAGAAAAATTGATTTCCGAAATAGAAAATTGTGATGAACCTGCTTTGGTACTGCATCTGGTAGTATTGGTTCTGTTTACTGTGCTGTCTCAGAACATGCTTCATGCGTCCGGAAGACAGGTCCCTTTGATAATTGCTTTCTTGAAATCACAGCTGAAGGATGAAGATTTTGATAAAGTTCAGAAATATCATGAATTGGTTGCGAAGTATTTAACTGCTGCTGATGATGAAAAAGAGGTGATTGAAGAGAAACTAAGAGAAGACTTACCTCTTCTGAAATCATTAGTTGCAGAAGTAGTAAACGGAGATCCACCCGATGGGGGTTTACGGGCATACACGGTGGTTTTAGGCTCTTTTTTAACAAATGGATTAATATTCGGTGTTATAAATTCATACAGTGTTATATATACTGTCCTACAAAAGAGGTTAGAAGATGAAAATGTACCAAATTCTGAAAGCAGAGCTGCTTTGGTTGGCGCTTTGACGATGGGAACTACCTTCCTATTGTCACCTATCTCAGGAGTGCTTACAGGACTAATGGGACTGAGGTGTACGGCTGTCTTGGGTGGAACTATTGCAGCATTTGGTTTGCTTATATCATCGTTCGCAATTGATTATGTCAATGTGCTATGTTTTACTTACGGCGTTATGTATGGTTTGGGAGCTTCTCTAGCCTACACTCCCTCACTGGCCATTCTTGGCCATTACTTCAAGAAACGTTTAGGTTTCGCTAACGGTATAGTTACCATCGGCAGTTCTGTTTTTACGGTAATAATACCACCTTTAATGGAACTGATGATAGAAAAATACGGTTTACCTGGACTATTCAGAGTATTGGCTCTTATAAGTGTGGGTATAGCCTTATGTGGCTTACTTTTTAAACCGATTCCAGTAGTTATTATAGATAAACCGGCTAGAAAAAATCATAAGGCACTTTTAAAAACAATAATTAACGTTCAAATTTGGAAGAACAATAAATATAGGTATTGGGCGATGTCTATGCCTATTGCTCTTTTCGGGTACTTTGTACCCTACGTACATATTAAGAAGTTTATAGAACTGAATTTCACTAATGTAAACGATAACTTACCCCTTCAGTGCATAGCTGTGACTTCGGGTATCGGGAGATTAATTTTTGGTATTTTGGCCGATAGAAAATGGGCAAATAGAATTATGTTACAACAAATATCGTTTTACGCAATTGGTACGTTAACAATAATATTGCCGTATGTAAAATCTTTTCCGTTGTTAGTCGTGATATCTTTGGGTATGGGGATATTTGATGGCGCATTTATAGCACTAATTGGGCCGATAGCTATACAGTTCTGTGGCAGCGCCCAGGCAGCGCAAAGCATTGGTTGTATGTTGGGTGTAGCGGCTTTCCCCCTATCATTAGGTCCACCCATAGCTGGGCTTATATTTAAAGCTCATCATTCCTATACGCTACCGTTCATATTAGCCGGCATTTCACCGTTAGTAGGTGCGACCATAATGTTCGCTATTAGGTTCCAAAAATAG

Protein sequence:

>DPOGS203162-PA
MAPSTDWDEIKRLAADFQKAQLSTTAQRLSERNCIEIVSKLIELKLIDVIFTIDGKEYLTPQQLIREIKDELYVRGGRVNTVDLAKELNVDLNQINLNVTEIIKGKEVQLVSGSLIAHYYLEKIAREINEKLQLQGQITVGDLTLQYDLPADLLQHGILEKYLGKIINGRQDPSDPRVFYTEEYITRTKAKIRGALMGLLKPTPINLILSHCNLTERLFMYLFDQLNAPGVLTGRQSGALYVPSCYTKSQNDWVINFFKQNNYLEYDALTRLGISDPKGYVKRVLSNENITFLSSCIIGSQIKQQLETALEECIASKSYLDVVSLLPSVLSDEDIENVLDALLKTNSSTILFDKTVFSNQYIENLKQACLPLAQKNAETVVKSGKYQQFYLEKQLVKNEAQQSHVDHKAERREERRKKASSGKGGGGTQGRETKTKAVKKHPRSKQVVQDSDSDEAPSVKKTPSQLEIVKVEDIENIIKEPLENEGLDELVTPISEYIQGHLNQTALAIAKDLTEKLLQDANQNRKQTHSSAQDKINILVNDIKLYEKGLKLFPSDQQVQFIKYLLKSFGGDILSEFCKYAANQSNLSVPVDNLSVEQRNKIMNDLPEEYMKPIRALNSTLSEQNMEQFYQAVDVCLAECGMILKKVDKKKDRLLVQNHREKLISEIENCDEPALVLHLVVLVLFTVLSQNMLHASGRQVPLIIAFLKSQLKDEDFDKVQKYHELVAKYLTAADDEKEVIEEKLREDLPLLKSLVAEVVNGDPPDGGLRAYTVVLGSFLTNGLIFGVINSYSVIYTVLQKRLEDENVPNSESRAALVGALTMGTTFLLSPISGVLTGLMGLRCTAVLGGTIAAFGLLISSFAIDYVNVLCFTYGVMYGLGASLAYTPSLAILGHYFKKRLGFANGIVTIGSSVFTVIIPPLMELMIEKYGLPGLFRVLALISVGIALCGLLFKPIPVVIIDKPARKNHKALLKTIINVQIWKNNKYRYWAMSMPIALFGYFVPYVHIKKFIELNFTNVNDNLPLQCIAVTSGIGRLIFGILADRKWANRIMLQQISFYAIGTLTIILPYVKSFPLLVVISLGMGIFDGAFIALIGPIAIQFCGSAQAAQSIGCMLGVAAFPLSLGPPIAGLIFKAHHSYTLPFILAGISPLVGATIMFAIRFQK-