Monarch geneset OGS2.0

DPOGS202939
TranscriptDPOGS202939-TA1458 bp
ProteinDPOGS202939-PA485 aa
Genomic positionDPSCF300220 + 226823-232409
RNAseq coverage79x (Rank: top 64%)
Annotation
HeliconiusHMEL0142161e-15772.42% 
BombyxBGIBMGA001907-TA5e-9359.52% 
DrosophilaCG15890-PA9e-6537.09% 
EBI UniRef50UniRef50_Q6NPA25e-6337.09%RE46682p n=23 Tax=Endopterygota RepID=Q6NPA2_DROME
NCBI RefSeqXP_001659917.12e-6940.44%adenylate cyclase [Aedes aegypti]
NCBI nr blastpgi|3479721512e-6840.76%AGAP004562-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|2700088603e-7139.62%hypothetical protein TcasGA2_TC015466 [Tribolium castaneum]
Group
Gene OntologyGO:00550852.9e-20transmembrane transport
GO:00160212.9e-20integral to membrane
KEGG pathway 
InterPro domain[52-363] IPR0117012.9e-20Major facilitator superfamily
[1-364] IPR0161962.3e-16Major facilitator superfamily domain, general substrate transporter
Orthology groupMCL20686 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202939-TA
ATGGAAAAAGCCTGCCGCGCTGATCTAAATTTTTCAGAATTTATTTGCTCACAAGTCATAGCTGGTAATTTTAGTGATAACATGACACTAGTTGCCTTAGATAAATCACAAAAACTAGTCGCCGAAATGACAGCATGGAAACAACCCATACAAAGCGGTATACCAGCAATCGCAATATTGTTTATTGGTGCTTGGAGTGATAAAACTGGAAATCGAAAGCTATTGATGCTCATACCTATATTGGGAGAACTGATATCGGCTATAGGAATGATTTTAACCACATACTATTTTCTAGAGTGGCCTTTATGGGTGACGGGATTAATAGAAGCTTTACCGTCTGCACTTACTGGTGGCTTATCAATAGTTCTAATGGGTTCATATAGTTTCATAGCTGACGTTACGACTGTGGAAAATCGTACGTTTAGAATAGGATGCGTCGCCGTTATTGTGACACTAGGAATTCCTTTGGGAACATCTATCAGTGGTGTTTTGACACAGCAAGTGGGATATTATGGAATATTTGGTATAGGTGTAGTATTTTTTACATTTGGATTTCTACAAACGTGGTTTAGAGTACACGATGTTAGAAACGAACCATTAAAAGGTACTTTCGTGGAAAAGCTTTTAAGTTTCTTTAATCCTTTAAATGCTTGGGATACTTTATCCCTACTATTTATACCTCGCACGAAAAAGTTGATACCAATATGGTTGGTTGTTTGGGCTCATATAATTGTTATGGGACCTGTTTTTGGTGAAAATCCCGTATTATATTTGTATACTTTGAAAAAGTTTAAAATGGATGTCGTGGACTTTAGTCTATTTTCTACTTACTCAGTGCTCATGGGACTAGCTGGCACTTCAGTGGCAGTGGGAGTCCTTAGTAAGATATTAAAGATTCACGATGCAGCTCTAGGTGTCTTAGCAACTTCATCAAAAGTTCTTTCAAGCATGCTCTATGGCTTAGCACCGACCAGAACTTGGTTTTTCGTGGGCCCTGTTTTAGACTTCTTCGGGAACTCTGGGTCCACTGTGGTGAGATCGATGGGTACAAAAGTCGTTGAAGCTGAGAAAGTCGGCACTTCAGTGGCAGTGGGAGTCCTTAGTAAGATATTAAAGATTCACGATGCAGCTCTAGGTGTCTTAGCAACTTCATCAAAAGTCCTTTCAAGCATGCTCTATGGCTTAGCACCGACTAGAACTTGGTTTTTCGTGGGCCCTGTTTTAGATTTCTTCGGGAACTCTGGATCCACTGTGGTGAGATCGATGGGTACAAAAGTCGTTGAAGCTGAGAAAGTCGGAAAGATGTGCTCATTAATTGGTTTTGTGGAATCCGTAGTGCCTGTTATTTACACACCTTTATACAGCAAAGTGTACTCGCTGACATTAGAAACATTTTCTGGTGCATTTTATGTGATGGGCAGCTTGATGACATTACCAGCGATTTTCATATTTTTGTAA

Protein sequence:

>DPOGS202939-PA
MEKACRADLNFSEFICSQVIAGNFSDNMTLVALDKSQKLVAEMTAWKQPIQSGIPAIAILFIGAWSDKTGNRKLLMLIPILGELISAIGMILTTYYFLEWPLWVTGLIEALPSALTGGLSIVLMGSYSFIADVTTVENRTFRIGCVAVIVTLGIPLGTSISGVLTQQVGYYGIFGIGVVFFTFGFLQTWFRVHDVRNEPLKGTFVEKLLSFFNPLNAWDTLSLLFIPRTKKLIPIWLVVWAHIIVMGPVFGENPVLYLYTLKKFKMDVVDFSLFSTYSVLMGLAGTSVAVGVLSKILKIHDAALGVLATSSKVLSSMLYGLAPTRTWFFVGPVLDFFGNSGSTVVRSMGTKVVEAEKVGTSVAVGVLSKILKIHDAALGVLATSSKVLSSMLYGLAPTRTWFFVGPVLDFFGNSGSTVVRSMGTKVVEAEKVGKMCSLIGFVESVVPVIYTPLYSKVYSLTLETFSGAFYVMGSLMTLPAIFIFL-