Monarch geneset OGS2.0

DPOGS210769
TranscriptDPOGS210769-TA4818 bp
ProteinDPOGS210769-PA1090 aa
Genomic positionDPSCF300312 - 2683-30904
RNAseq coverage194x (Rank: top 48%)
Annotation
HeliconiusHMEL0054400.090.47% 
BombyxBGIBMGA011390-TA0.078.74% 
Drosophilahoe1-PC0.061.74% 
EBI UniRef50UniRef50_Q8IGX60.061.74%RE09889p n=21 Tax=Arthropoda RepID=Q8IGX6_DROME
NCBI RefSeqXP_002001964.10.061.94%GI14413 [Drosophila mojavensis]
NCBI nr blastpgi|1951148180.061.94%GI14413 [Drosophila mojavensis]
NCBI nr blastxgi|1571174320.061.95%tyrosine transporter [Aedes aegypti]
Group
Gene OntologyGO:00151055.5e-143arsenite transmembrane transporter activity
GO:00160215.5e-143integral to membrane
GO:00550851.1e-83transmembrane transport
GO:00157461.1e-83citrate transport
GO:00151371.1e-83citrate transmembrane transporter activity
KEGG pathway 
InterPro domain[599-1087] IPR0008025.5e-143Arsenical pump membrane protein, ArsB
[597-1028] IPR0046801.1e-83Divalent ion symporter
Orthology groupMCL10326 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210769-TA
ATGGATACATTTGTGCAAAGAGTGAAAGCGTGGAGCGGCGTCTCGAAGACGCCAGCGGCACCTGAGCTCAGCAGAAGTCAATACTCCTTGGTCTCAACGGGAGATTTGACACCAGAGGCTCTTCAAGTGTGGCTGAAGTTGCCGAAGAAGATAAAGTATGACCCTGAACTAGAGCCTTTCCAGAAGCGATACGAAAAGGAAATCGCCGGGAAGCAAGAAGAGATAGCACTTAGAATTAAAAATGGGTCAGTCCCGGAAACGAAACGCCTACCTCTAGTAAAATCAGCGCCACATCTTAATAATAATAAAGTAGAAGAAACTCCCAGCGCTGTTCTAGATGGAGTGGAAATTGAAAGTGCTGTCAAAGATAACAATGGGGTCGATGCAAACTCACATAAGCGGAAGAGTAAGTTTGCGTCAGCCCGTCACTACATCAAAATGTCCCTTTTATTGGGATGTTGGCTCTTCTTTACAATAGTTTTCATGATGTACAACGAAAAGGAAGACGTCCATAGGAATTCATACGTAAATCCAGGAGAAATTAAGACCGGGAAGCAAGAAGAGATAGCACTTAGAATTAAAAATGGGTCAGTCCCGGAAACGAAACGCCTACCTCTAGTAAAATCAGCGCCACATCTTAATAATAATAAAGTAGAAGAAACTCCCAGCGCTGTTCTAGATGGAGTGGAAATTGAAAGTGCTGTCAAAGATAACAATGGGGTCGATGCAAACTCACATAAGCGGAAGAGTAAGTTTGCGTCAGCCCGTCACTACATCAAAATGTCCCTTTTATTGGGATGTTGGCTCTTCTTTACAATAGTTTTCATGATGTACAACGAAAAGGAAGACGTCCATAGGAATTCATACGTAAATCCAGGAGAAATTAAGAGTTACGTATTAAATATTTCTGACCCGTCTCTGACACTTCGCGTAAAACTATCGGGACCTTTCCTTAGCGAGCAAGCGGAAGGCAAAATGAACTCCTCAGAAATGACAGGTTACCAGAAAATGGATGTGTGGCTGGAGAGATGGAGGGTAGAAAATGTAGGCAAGATAGTATCAGAAAGAGATGTCCTTGACAAAGAGGTTTCAAAAATCTGGACGATTCTCATTCATGAAGATGACTTGGATTTTACGACGGGGGAGTCTCGATCTAGCATTATTAATTTAAAATCAAATTCTACCAATGGGAACTCTGTGTTTGCTCTTAGAATGCGAACAACAGCCAATCAGAGTGCTCCGATCAGTTTGAACTATGTCATAAATCCGTTGGATCGTAATACAGGAGTTATATATTCCTGTATCCTGCTCTGTGGGTTGTATATATTGATTATATTTGAGGTCATCAATCGTACGATGGCAGCGGTTCTTATATCAACAACATCATTAGCAATTCTTTCAATAGTAGGAGAGAGACCTTCCCTCCCTGAAGTGATCTCATGGCTGGATGTGGAGACGCTCTTGCTGCTTTTCAGTATGATGATCCTCGTAGCCATAATGGCTGAGACTGGCATGTTTGATTTCTTAGCAGTATTTACTTTCGAGGTGACTAAAGGAAAACTGTGGCCATTGATAACATTACTATGTGTGATAACAGCGGTTCTGTCTACCTTCCTGGACAACGTCACCACCGTCCTTCTAATGACCCCAGTGACAATTCGGCTCTGTGAAGTTATGGACATGGATCCAGTGCCCATCCTCATGTCTATGGTGTTGTTCAGTAATATAGGTGGTACAGCAACACCGGTGGGAGATCCACCCAATGTTATCATAGCATCTAATAAAGCTGTCGTCCAATCGGTCATCAATCGTACGATGGCAGCGGTTCTTATATCAACAACATCATTAGCAATTCTTTCAATAGTAGGAGAGAGACCTTCCCTCCCTGAAGTGATCTCATGGCTGGATGTGGAGACGCTCTTGCTGCTTTTCAGTATGATGATCCTCGTAGCCATAATGGCTGAGACTGGCATGTTTGATTTCTTAGCAGTATTTACTTTCGAGGTGACTAAAGGAAAACTGTGGCCATTGATAACATTACTATGTGTGATAACAGCGGTTCTGTCTACCTTCCTGGACAACGTCACCACCGTCCTTCTAATGACCCCAGTGACAATTCGGCTCTGTGAAGTTATGGACATGGATCCAGTGCCCATCCTCATGTCTATGGTGTTGTTCAGTAATATAGGTGGTACAGCAACACCGGTGGGAGATCCACCCAATGTTATCATAGCATCTAATAAAGCTGTCGTCCAATCGGGTGTAAACTTTACGAATTTCACAATGCATATGACGATTGGCATCTTGCTTGTATGCGTACAGACATACTTCCAGCTACGTTATATATACAGGGACACCAACAAGCTAAGGTTGAATGTACCAAGGGATATACAAGATATACGTCATCAAATATCAATATGGCGTAGAGCGATAGAATCTCTCCCGCACTTGAGCAAAGATCAACATGTGGTCCGTGAAAGGCTGGAAAGAAAAGTGACGAAATTAAACTTACAATTGGACACCATGGTTAAGGAGAGCTACAAAAGAGTATGCCCAAAAGAAACTTTTCAGACTACGCTTAGCCAATTGAAAGACAAGTACGTAGTGAGGGACAAAATGCTGTTGATAAAATCGACTATAGCCATTACCTTTGTAGTCGTTGTGTTTTTTTTACACTCCATGCCGGAATTGAATCGAGTATCTTTAGGATGGACAGCTCTTTTAGGAGCAATACTATTGCTGACTTTGGCTGACAGGGAAGATCTCGAGCCGATATTACACAGAGTTGAATGGTCTACTCTGTTATTTTTTGCCGCTTTATTCGTGCTTATGGAGGCATTATCAAAGCTTGGTCTCATCGAGTTTATTGGTGGTATCACAGAATCTTTAATACTCAAAGTAGACGAGAATGGTAGATTAGCGGTGGCCATTTTACTCCTATTGTGGGTGTCAGGTGTAACATCAGCGTTTGTAGACAACATCCCTCTCACCACGATGATGGTGCGTGTCGTCATTGCTTTAGGATCGAACCCCAATCTGAATTTACCGATAACCCCATTGATATGGTCCCTGTCGTTTGGAGCTTGTTTGGGAGGTAACGGCACATTGATTGGTGCTAGCGCCAACGTTGTATGCGCAGGCGTCGCTGAACAGCACGGCTATCGGTTCACCTTCATGCAGTTCTTCAAAATTGGTTTCCCGGTCATGATAGGACATCTCATAGTCGCATCCGGATATCTACTCGTCTGTCACTGCGTGTTTACCTGGCATTAGGGCTTTTTATTCTATTTGATATTATCTAATGTGTATTTTGCACATATGAAGGCTATAAATCGCAATAATCAATCAAATTGTATATATGAAACCACCAATTTATTAGTCGATGATAAAATATGATAGTCCCTAATATTGTTTATAGATTGAGTATCAAAAAAATATATAAATTTATATATACATATATAAATTTAAATCTCAAACTAAGAGGTTATATATGAAAGGTCAATTTCGTAGAACCGGATATGGAAGAGCAGTTGTTCTTATTATTAAATTAACCCAAGTACCTTAAAATATTATATGTGATTTAAAAATTATGTCGTACTTAAATATCAATTTCGTTTAAATTATTTTTAATAGTATTTAATTAAGTGACCGCAAAGACAATGATGAAGACAGGGCATCGAAGCGGTCGAAACTTCGCCCTTTATTTATTTCCTTTTATAATTATTCTAATGTATTAATTTTACAAAATTTTGTAGATATCATTTTTGTTTTTGTTAGAAATTAGAACTAAATAATTGATATAAAACAAACAATTATATTTTTTTTGTAAAAACGTTAGGTTTGTGTTTGTATTTAAAATTTAGTTATAACTAGAATGTTTTAGAATCAGTATTAGACAATGTTATGATTAAAAAATAAGATTTTTTAGATATTTACACTAAGGTTTCTTAGTGTCAAAATGAGGAAACATTGTTGGTAATTACGCCAAGACATAGATAATTCATAGGAATTTGAGTATAAATATAATTTGCTATCAAATTTACGAAACAAAATAACTTTAGCGAAAATTGTAAGAGAAAAAAATCCAGTATTGTTTTTCTAAATTTTGTACAAGGAACGTTTTATTTAAATCATTATTTTTATATCAATATACATCTTCATTTAACTTTAAGTATACTAAAACAATTAAATATATAAATTTACAAAGAAAGCTTTTTAATCCGTTTCATTATGTTTATTCCTCTAGAAGTTATGTAGGACTTAAAACTATTTCAAAGGGCATACAATTTTTGCTCAAAGGAACTAATATTTCCAAAACTACATATAATACTCGTATTACTGGTTAGGTTTTTGCAGATATAACCAAGTATCTTATCTAAAAAAAGTGTATTAAGCAACGACCACATTAGGTATATTCCTCAATTGCCTGTTTGGCAAAGGAATTAAAGCTGGATAGAACATTCACAGATAAATATAAAGAGCGATTCTCTCGTGTGTATCCACGTCATTAATTGCAGAAGAAGGCAACGATCTCGCATTGGCAGATATCAGAATTAACACAATGTAACGAAAGTCATGGATGTCACGATTTTTTTTTATACCACGACGTTTGCGAACTCGTTTACAGCCTGCATGATGAAACTTTTTCAATAGTCCCTTGAGAAAATCTCTGCAGGGAGTTTATTTCATAGCGCCAGCGTTAGTGGAGGAAATTGATCTGTTGCCGCTCAGTGATGCACGACCAATAGCTAGCCAAGGAGTGAGGATGGACACTCTGAAGATGGCGAGCGACATGGGGGCAAATGAACTGCAACATAG

Protein sequence:

>DPOGS210769-PA
MDTFVQRVKAWSGVSKTPAAPELSRSQYSLVSTGDLTPEALQVWLKLPKKIKYDPELEPFQKRYEKEIAGKQEEIALRIKNGSVPETKRLPLVKSAPHLNNNKVEETPSAVLDGVEIESAVKDNNGVDANSHKRKSKFASARHYIKMSLLLGCWLFFTIVFMMYNEKEDVHRNSYVNPGEIKTGKQEEIALRIKNGSVPETKRLPLVKSAPHLNNNKVEETPSAVLDGVEIESAVKDNNGVDANSHKRKSKFASARHYIKMSLLLGCWLFFTIVFMMYNEKEDVHRNSYVNPGEIKSYVLNISDPSLTLRVKLSGPFLSEQAEGKMNSSEMTGYQKMDVWLERWRVENVGKIVSERDVLDKEVSKIWTILIHEDDLDFTTGESRSSIINLKSNSTNGNSVFALRMRTTANQSAPISLNYVINPLDRNTGVIYSCILLCGLYILIIFEVINRTMAAVLISTTSLAILSIVGERPSLPEVISWLDVETLLLLFSMMILVAIMAETGMFDFLAVFTFEVTKGKLWPLITLLCVITAVLSTFLDNVTTVLLMTPVTIRLCEVMDMDPVPILMSMVLFSNIGGTATPVGDPPNVIIASNKAVVQSVINRTMAAVLISTTSLAILSIVGERPSLPEVISWLDVETLLLLFSMMILVAIMAETGMFDFLAVFTFEVTKGKLWPLITLLCVITAVLSTFLDNVTTVLLMTPVTIRLCEVMDMDPVPILMSMVLFSNIGGTATPVGDPPNVIIASNKAVVQSGVNFTNFTMHMTIGILLVCVQTYFQLRYIYRDTNKLRLNVPRDIQDIRHQISIWRRAIESLPHLSKDQHVVRERLERKVTKLNLQLDTMVKESYKRVCPKETFQTTLSQLKDKYVVRDKMLLIKSTIAITFVVVVFFLHSMPELNRVSLGWTALLGAILLLTLADREDLEPILHRVEWSTLLFFAALFVLMEALSKLGLIEFIGGITESLILKVDENGRLAVAILLLLWVSGVTSAFVDNIPLTTMMVRVVIALGSNPNLNLPITPLIWSLSFGACLGGNGTLIGASANVVCAGVAEQHGYRFTFMQFFKIGFPVMIGHLIVASGYLLVCHCVFTWH-