Monarch geneset OGS2.0

DPOGS212185
TranscriptDPOGS212185-TA2154 bp
ProteinDPOGS212185-PA717 aa
Genomic positionDPSCF300344 - 54518-69747
RNAseq coverage572x (Rank: top 22%)
Annotation
HeliconiusHMEL0131520.080.45% 
BombyxBGIBMGA010726-TA2e-15242.40% 
DrosophilaCG11147-PB5e-17143.68% 
EBI UniRef50UniRef50_Q7PYQ47e-17945.81%AGAP002060-PA n=1 Tax=Anopheles gambiae RepID=Q7PYQ4_ANOGA
NCBI RefSeqXP_320987.40.046.52%AGAP002060-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3454914353e-17845.25%PREDICTED: ABC transporter G family member 20-like isoform 1 [Nasonia vitripennis]
NCBI nr blastxgi|1571051911e-17845.84%abc transporter [Aedes aegypti]
Group
Gene OntologyGO:00055241.3e-16ATP binding
GO:00168871.3e-16ATPase activity
GO:00001661.2e-10nucleotide binding
GO:00171111.2e-10nucleoside-triphosphatase activity
KEGG pathway 
InterPro domain[51-170] IPR0034391.3e-16ABC transporter-like
[36-225] IPR0035931.2e-10ATPase, AAA+ type, core
Orthology groupMCL17455 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212185-TA
ATGGCGGCACCAGCCGTAGTTGTGGAGCGGGCTTACAAGTATTATGGCAAACAGGACAAGCCCAGCTTCAAACCAGTCCTGCAGTACTTGGACATGACCGTGGAAAAAGGAATTATATATGGCCTACTTGGACCCTCTGGATGTGGGAAAACAACACTTCTCTCATGCATAGTCGGCAGACGAAAGCTCGACAGTGGTGAAGTTTGGGTGCTGGGAGGAAGACCAGGGGAGAAAGGCAGCGGAGTTCCCGGGCCTCGAGTGGGATATATGCCTCAGGATATCGCTCTTGTTGCGGAATTTTCAGTACGAGATGCTGTATACTACTTCGGTAGGATCTATGGGATGAAGCCAAAGAAACTCCAAGAACAGTTTGAATTTCTGACTAGATTGCTAGAGCTCCCAAAAGGCGGTAGACTCATCAAATCACTTTCCGGAGGGCAGCAACGGCGGGTCTCGCTGGCCGCTGCACTGGTCCACGAGCCTGAACTTCTCATTCTGGATGAGCCCACAGTTGGACTGGATCCGGTCTTAAGAGAACGAATCTGGGAATTTCTGGCTGAGCTGGCGAGAGGTGGTACAACAGTCATCATCACCACTCACTACATCGATGAAACGAAACAAGCGCACAAGATTGGTCTCCTCCGAGACGGTCAGCTGTTAGACGAAGATTCTCCCGAGGAGATCCTCCGAAGATACAATTGCAATAACCTTGAAGATGCCTTCCTTAAGATGGCGATGCGACAAACTGAAATGAAACACCGAAGACGACCTACTCTCACAGCTTCTCCAGATGTTATACCTGAGAGCAGCATAGTTAATGACAGTCGATACCAATCAAGAGAAGAGTTTAATGTAGTAACTAGTAGCACCGATGCTCTAACCAAAAAACAGAAGCCACAGCACATTCCGAGCAATTCAAAAGGCAGATACAAAGCAGTGTTTATTAAAAGTATACAACAGTTTTCCAGGCATCCTGGGGGACTAATATTCTCGGTGCTATTCCCCATCATCCAAGTAGTCGCATTCTTCCTCGCCGTGGGTCATGACCCTCGTGATCTACACGTTGCTGTCGTCAATTATGAAGCTGCTACGTCACCATTGGGTATAGATGTATGTAAGAATGGAAGTCTAACAACTGTGATACAAAGGGAGGACGAGACATGCGAACAATATATGCTGAGCTGCTGGTTTCTGGAGGAAATGGAAAAACGAAAATTGTATCCTACGACATATAACTCGACCGAAGAAGCAAAACAAGCTGTAGCAACTCGCAAGCTATATGGAGCGATCCGTTTCCCTTCCAACTTCAGCCTAGCGCTAGGGATACGAGCTGCTGAAGGTTTCGTGGGAGACAGCATCCTCAACGACAGTACTATCAGCGTTTGGCTCGATATGACCGACCATCAAATATCACACTTTATAAAGATGCAACTTCACAAAGCTTACGAACATTTCGCTCGTCGTACCATGGCTGCTTGCGGAAAGAATGAAGACCTCGTGCAGATTCCGGTGAGATTTGAAGAACCCATTTACGGATCGATGAATACTGAGGTCGTTGCTTACATGGCTCCGGGGGTAATGGTCACTATAATCTTCTTTCTGGCTGCGATAATTACGTCGACTCTTATGATCTCCGACCGTCTTGAGGGTGTGTGGGAGCGAAGCGCGGTTGCCGGAGTCCGACCCAAGGAGATGTTGCATGTTCATATTACTTTACAGAGCATGGTCATTCTTGTACAGACATTCGAGATGATGATTGTAGCGTTTATGGGCTACAAGTTGCCATTTAACGGCTCCTTATGGACGTGTGGCGCGCTGCTTTTCCTCCAGGGCCTCGGAGGAATGTGCTATGGCTTCCTTTTGTCTGTATTATGTTCCAGCTTTACTGTCTCTTTCTTTATAGCAACCGGCAGCTTCTATCCAATGATATTGCTCTGTGGTATTCTCTGGCCTCTGGAGGGTATGCCAGAAGCACTACGACTCTTCTCTTTGGCGCTGCCCTTCACTCTACCTTCAATATCGTTACGAGATATGATGGAGAAAGGTTCATCCATCACCAGCCCAAGCGTTTACACCGGATTTTTGATAACTTTAGGGTGGATTTTCGGTACTCTAGCGTTATGTTTCCTGAGACTAAGGTTTCGAAAGACTTAA

Protein sequence:

>DPOGS212185-PA
MAAPAVVVERAYKYYGKQDKPSFKPVLQYLDMTVEKGIIYGLLGPSGCGKTTLLSCIVGRRKLDSGEVWVLGGRPGEKGSGVPGPRVGYMPQDIALVAEFSVRDAVYYFGRIYGMKPKKLQEQFEFLTRLLELPKGGRLIKSLSGGQQRRVSLAAALVHEPELLILDEPTVGLDPVLRERIWEFLAELARGGTTVIITTHYIDETKQAHKIGLLRDGQLLDEDSPEEILRRYNCNNLEDAFLKMAMRQTEMKHRRRPTLTASPDVIPESSIVNDSRYQSREEFNVVTSSTDALTKKQKPQHIPSNSKGRYKAVFIKSIQQFSRHPGGLIFSVLFPIIQVVAFFLAVGHDPRDLHVAVVNYEAATSPLGIDVCKNGSLTTVIQREDETCEQYMLSCWFLEEMEKRKLYPTTYNSTEEAKQAVATRKLYGAIRFPSNFSLALGIRAAEGFVGDSILNDSTISVWLDMTDHQISHFIKMQLHKAYEHFARRTMAACGKNEDLVQIPVRFEEPIYGSMNTEVVAYMAPGVMVTIIFFLAAIITSTLMISDRLEGVWERSAVAGVRPKEMLHVHITLQSMVILVQTFEMMIVAFMGYKLPFNGSLWTCGALLFLQGLGGMCYGFLLSVLCSSFTVSFFIATGSFYPMILLCGILWPLEGMPEALRLFSLALPFTLPSISLRDMMEKGSSITSPSVYTGFLITLGWIFGTLALCFLRLRFRKT-