Monarch geneset OGS2.0

DPOGS200173
TranscriptDPOGS200173-TA2142 bp
ProteinDPOGS200173-PA713 aa
Genomic positionDPSCF300128 + 413275-427176
RNAseq coverage746x (Rank: top 17%)
Annotation
HeliconiusHMEL0022380.085.18% 
BombyxBGIBMGA002922-TA0.084.33% 
Drosophilaw-PA0.054.76% 
EBI UniRef50UniRef50_Q8WRF20.056.50%ABC transmembrane transporter n=27 Tax=Pancrustacea RepID=Q8WRF2_TRICA
NCBI RefSeqNP_001037034.10.083.89%ATP dependent transmembrane transporter protein [Bombyx mori]
NCBI nr blastpgi|2187750250.084.31%ABC transporter [Bombyx mori]
NCBI nr blastxgi|2187750250.084.31%ABC transporter [Bombyx mori]
Group
Gene OntologyGO:00160203.3e-27membrane
GO:00055244.8e-10ATP binding
GO:00168874.8e-10ATPase activity
GO:00001661.7e-08nucleotide binding
GO:00171111.7e-08nucleoside-triphosphatase activity
KEGG pathwaycme:CMS467C2e-85 
 K05681 (ABCG2)maps-> ABC transporters
InterPro domain[413-565] IPR0135253.3e-27ABC-2 type transporter
[126-256] IPR0034394.8e-10ABC transporter-like
[111-305] IPR0035931.7e-08ATPase, AAA+ type, core
Orthology groupMCL16636 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200173-TA
ATGACCGCCGGATCGGAAGAGCACGAGCCCCTGATATCTTCTTCGGTTGATAATCAACGTGTTACATATAATAATTCTCCTCAGGACACATCACCAAACGACAGTCCACGTGGAAGCGCTGGTGAAGTAACTTTAGCTATACCACAACCAAGAAGCTATGGCGCGATCGGTGGCGCTGAGAAGGTCACTTACACGTGGGCGGACATCAACGCCTTCGCCACCGAAGACAGATCGAGGAATAGAAAGTTCTGGAATTTCTGGAGGGGATCCAACAACAGAATGTTCCAGCAGAGAAAACAGCTCCTCAGAAATGTTAACGGCGCCGCTTACCCTGGAGAGTTGTTGGCGCTGATGGGCTCGTCAGGAGCTGGGAAGACCACGTTATTAAACACCCTCACATTCCGTACCCCGAGTGGTGTAATGTCCAGCGGTACCAGAGCTTTGAATGGACAGCCAGCCACTCCTGAGGCTATGGCTGCTCTGTCAGCGTATGTACAACAGCAAGACCTCTTCATAGGAACTCTCACTGTCAAGGAGCATCTGATATTCCAGGCTCTGGTGAGAATGGATAGACACATACCTTACGCACAGCGCATGCGCAGGGTCCAAGAAGTCATATCGGAGTTGGCATTGACAAAATGCCAGAACACGGTGATCGGTATCCCTGGCAGATTGAAGGGTATATCTGGCGGAGAGATGAAGAGGTTGTCGTTTGCCAGCGAGGTACTCACAGACCCTCCGCTCATGTTTTGCGATGAACCCACATCGGGGTTGGATTCTTTCATGGCACAGAATGTTATTCAGGTACTGAGTGGTTTAGCCAAGAAAGGCAAGACCATTGTATGTACGATACATCAGCCGTCTTCCGAATTATACGCTATGTTTGACAAACTTCTAATAATGGCGGAGGGTAGAGTCGCTTTTCTCGGCTCTCCTGATCAAGCGACAGAATTCTTTAGAGATCTAGGAGCCGCATGTCCCGCTAACTACAATCCAGCGGATCACTACATCCAACTGTTGGCGGGAGTGCCAGGCCGAGAGGAGACCACACGTCACACTATAGACACCGTGTGCACAGCCTTCGCACGTTCAGAAATTGGATGCAAGCTGGCTGCTGAGGCGGAAAACGCCTTATATCACGAGCGTAAGATAGCGTCTGGCTGGGTGGAGTCCCCGTGGTCGTCAAGCCTGACGGGGCGGTACCCTCGCTCCCCGTACAAGGCCTCGTGGTGCGCTCAGTTCCGGGCGGTGCTATGGCGCTCGTGGCTCTCCGTCACCAAAGAGCCCATGCTCATCAAAGTGCGCTTCCTACAGACTATCATGGTGGCGTTGTTGATCGGCGTGATCTACTTCGGCCAGGAGTTGGATCAGGACGGCGTGATGAACATCAACGGAGCCATCTTCATGTTCCTCACCAACATGACCTTCCAGAACATCTTCGCTGTCATCAATGTGTTCTGCTCGGAGCTGCCGATCTTCATCCGCGAGCATCACTCGGGCATGTACCGCGCGGACGTGTACTTCCTCAGCAAGACGCTGGCGGAGGCGCCGGTGTTCGCCACCATCCCTCTCGTGTTCACCACCATCGCCTACTACATGATCGGCCTCAACCCGTCGCCCGAGAGGTTCTTCATCGCATCGGGCCTGGCGGCGCTGATCACCAACGTGGCTACTTCCTTCGAGTACGCGCTGGGCTACCGACAAAATGCCACTAGGACGTCTCTAGCGACAGGAACCGGTCCGAGGGGTTCCGGCTTTGAAGGATACCTCATCTCCTGTGCTAGTAGCAGCGTGAGTATGGCCGCGTCTGTGGGACCGCCCATCATCATACCGTTCATGCTCTTCGGAGGATTCTTCCTCAACTCAGGCTCCGTGCCGCCGTACCTCGGCTGGATCTCGTACTTGTCGTGGTTCCGTTACGGCAACGAGGCGTTGCTGGTGAACCAGTGGTCGGGGATCGAGAGCATCGCGTGCACGCGAGAGAACTTCACGTGCCCGGCGAGCGGGGACGTTGTACTCACGACCTTGAGTTTTTCGGAGAAAGACTTCACAATGGATGTAGTGAACATGGTCCTCTTGTTCGTGGGTTTCCGTCTGCTGGCCTACTTCGCTCTGCTGTGGAGGACGAGGCGCGGGAAATAA

Protein sequence:

>DPOGS200173-PA
MTAGSEEHEPLISSSVDNQRVTYNNSPQDTSPNDSPRGSAGEVTLAIPQPRSYGAIGGAEKVTYTWADINAFATEDRSRNRKFWNFWRGSNNRMFQQRKQLLRNVNGAAYPGELLALMGSSGAGKTTLLNTLTFRTPSGVMSSGTRALNGQPATPEAMAALSAYVQQQDLFIGTLTVKEHLIFQALVRMDRHIPYAQRMRRVQEVISELALTKCQNTVIGIPGRLKGISGGEMKRLSFASEVLTDPPLMFCDEPTSGLDSFMAQNVIQVLSGLAKKGKTIVCTIHQPSSELYAMFDKLLIMAEGRVAFLGSPDQATEFFRDLGAACPANYNPADHYIQLLAGVPGREETTRHTIDTVCTAFARSEIGCKLAAEAENALYHERKIASGWVESPWSSSLTGRYPRSPYKASWCAQFRAVLWRSWLSVTKEPMLIKVRFLQTIMVALLIGVIYFGQELDQDGVMNINGAIFMFLTNMTFQNIFAVINVFCSELPIFIREHHSGMYRADVYFLSKTLAEAPVFATIPLVFTTIAYYMIGLNPSPERFFIASGLAALITNVATSFEYALGYRQNATRTSLATGTGPRGSGFEGYLISCASSSVSMAASVGPPIIIPFMLFGGFFLNSGSVPPYLGWISYLSWFRYGNEALLVNQWSGIESIACTRENFTCPASGDVVLTTLSFSEKDFTMDVVNMVLLFVGFRLLAYFALLWRTRRGK-