Monarch geneset OGS2.0

DPOGS209284
TranscriptDPOGS209284-TA1800 bp
ProteinDPOGS209284-PA599 aa
Genomic positionDPSCF300589 - 2033-16548
RNAseq coverage121x (Rank: top 57%)
Annotation
HeliconiusHMEL0029202e-11971.48% 
BombyxBGIBMGA005094-TA1e-10735.59% 
DrosophilaCG9664-PA6e-10138.31% 
EBI UniRef50UniRef50_E3WTP89e-12240.07%Putative uncharacterized protein n=1 Tax=Anopheles darlingi RepID=E3WTP8_ANODA
NCBI RefSeqXP_001653357.13e-12038.76%abc transporter [Aedes aegypti]
NCBI nr blastpgi|3123811293e-12140.07%hypothetical protein AND_06636 [Anopheles darlingi]
NCBI nr blastxgi|3123811293e-12340.03%hypothetical protein AND_06636 [Anopheles darlingi]
Group
Gene OntologyGO:00055241.3e-189ATP binding
GO:00160211.3e-189integral to membrane
GO:00160204e-34membrane
GO:00168874.8e-17ATPase activity
GO:00001661.6e-08nucleotide binding
GO:00171111.6e-08nucleoside-triphosphatase activity
KEGG pathwaybta:5084435e-100 
 K05680 (ABCG4)maps-> ABC transporters
InterPro domain[16-594] IPR0200641.3e-189ABC transporter, G1-like
[326-535] IPR0135254e-34ABC-2 type transporter
[68-180] IPR0034394.8e-17ABC transporter-like
[53-229] IPR0035931.6e-08ATPase, AAA+ type, core
Orthology groupMCL18985 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209284-TA
ATGGATGTTGAAAGTGATTTGAAAAATGGCCGTTACTTGTGTCCGCTGACGTACACGGATCTTGGTTTCAGTGTTAAGGGCAGGTGGCTCTTTTCAAGGAAGAGAAAAGATAATGATTTACAAATTATTCTTAAGGATGCTTGTGGGGCGATTCGGCCAGGTCGCCTTACATTCATACTTGGACCCTCAGGGGCTGGAAAGTCTACACTACTGAAAATATTAGCCGGCAGAAAGAAATCTGGCGTGACCGGATCTCTGCAAGGAGTTTCCCGGAATGTAGTCTTGGTGCCTCAACACGTGACACTTATAGACACTCTCACTGTGAAAGAGACAATGCAATTCGCTGCTTCATTGAAACTATCTCGTGCTAGTTATCAAGAACAATATAATGCTATAGAAAGAATATTAAAACAATTAGGCATCCATGATGTCTTGAATACGAGAGCCGGTCGACTGTCGGGGGGAGAACGCAAAAGGCTCACCGTCGCCTGTGAGCTTCTCACGGATCCATCAATTATGTTACTCGATGAACCGACCAGCGGACTAGACTCCGTATCATCAATGTCAGTAGCGAGGGCTCTAAAAACAGTAGCACAAAGTGGAAGAACAGTGGCATGTGTGATACACCAGCCTTCTTCTCAACTGTTCACATCTGCTGATGACGTAATTCTAATGGCTAATGGCAGGACATTATACGCTGGCGCCGTCACGGATGTACCAGAACTGATAAGGAAATCTGGTTTCGTGTGCCCACTCTATTACAACTTGGCCGATTATCTTTTAGAAATTGCAAGCGGGGAACATCCTGGTAATCTAACAAATCTAGAAACTAGTACAAAAACTTATGCACTTGAAATTAAAAGAAATGCAGAAAAAGATACTCAGGAAAGAAGTGAAAAGGATAACTCAACAGAAGCTGAAGCCTTATTACACAGAAAAGTTCAGCCCGACAATCATTACGAGGCGGGATTTGTGCAACAACTACGGTCGCTTCTTTGGAGGGGCTACTTGGGAGCTCTTAGGGACATTCATTTAACGCAGATACGGATTTTAACCCATTTCGTGGTGGCGTTATTGTTGGGTGCTTTGTATCAAGGCGCCGGTGCGGAGGCGAGCCGTATGATTTCAAATACAGGATGTCTATTCTTTTTCTTACTATTTTTATATTTCTCAAACGCAATGCCCACAATACATACTTTCCCAGTGGAATCAACCGTGGTACTCCAAGAACATCTCAACAAATGGTATTCCCTATCAACGTATTGCATCACAAAAGTTATAGTGGATCTGCCTATTCAGCTACTTTGCGCAACGATATTTGTTTTACCGGCGTGGTACTTAACTTCTCAGCCACTTGAACCATATAGACTCATTTTGGCATGGAGTATTTGCACCCTCCTGACAATCCTAGCTCAAACTTTTGGCCTAGTAGTTGGAGCTGCGTGTGGAGTTAAGCTCGGTCTTTTCGTGATTCCTGCTGCCAACATCCCCATGTTAATGTTTTCTGAATTCTTTATCCCCTACCATGAAATGCCATCTTATTTACAGCCGTTCGCTTTAATGTCCTACTTCCGTTACTCTTTTGATGCATTCATGCAAACCGCTTACGGCTTCGGTCGCAAGAACTTACCTTGCAACAAAATATTTTGTCTATTCAAGCAACCTGACAAATATCTTGACTACTTGGGCCTTCAGCAAAACTTCTCAAATGATGTTATAGCACTTGTTATTTGGATTGTTTTATTAAAAATATCATTGATTTGTGTTTTAAAATACAGAGTCTTCAAAGCTTGTAGATAA

Protein sequence:

>DPOGS209284-PA
MDVESDLKNGRYLCPLTYTDLGFSVKGRWLFSRKRKDNDLQIILKDACGAIRPGRLTFILGPSGAGKSTLLKILAGRKKSGVTGSLQGVSRNVVLVPQHVTLIDTLTVKETMQFAASLKLSRASYQEQYNAIERILKQLGIHDVLNTRAGRLSGGERKRLTVACELLTDPSIMLLDEPTSGLDSVSSMSVARALKTVAQSGRTVACVIHQPSSQLFTSADDVILMANGRTLYAGAVTDVPELIRKSGFVCPLYYNLADYLLEIASGEHPGNLTNLETSTKTYALEIKRNAEKDTQERSEKDNSTEAEALLHRKVQPDNHYEAGFVQQLRSLLWRGYLGALRDIHLTQIRILTHFVVALLLGALYQGAGAEASRMISNTGCLFFFLLFLYFSNAMPTIHTFPVESTVVLQEHLNKWYSLSTYCITKVIVDLPIQLLCATIFVLPAWYLTSQPLEPYRLILAWSICTLLTILAQTFGLVVGAACGVKLGLFVIPAANIPMLMFSEFFIPYHEMPSYLQPFALMSYFRYSFDAFMQTAYGFGRKNLPCNKIFCLFKQPDKYLDYLGLQQNFSNDVIALVIWIVLLKISLICVLKYRVFKACR-