Monarch geneset OGS2.0

DPOGS203229
TranscriptDPOGS203229-TA1251 bp
ProteinDPOGS203229-PA416 aa
Genomic positionDPSCF300035 + 1297424-1306587
RNAseq coverage402x (Rank: top 30%)
Annotation
HeliconiusHMEL0032472e-16983.20% 
BombyxBGIBMGA009193-TA1e-10855.88% 
DrosophilaCG7458-PA3e-6230.99% 
EBI UniRef50UniRef50_E2A7K48e-6232.87%Solute carrier family 22 member 21 n=7 Tax=Formicidae RepID=E2A7K4_CAMFO
NCBI RefSeqXP_320175.32e-6332.23%AGAP012383-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3838630035e-6635.14%PREDICTED: solute carrier family 22 member 21-like [Megachile rotundata]
NCBI nr blastxgi|3838630039e-6635.14%PREDICTED: solute carrier family 22 member 21-like [Megachile rotundata]
Group
Gene OntologyGO:00550859.2e-17transmembrane transport
GO:00160219.2e-17integral to membrane
KEGG pathway 
InterPro domain[1-405] IPR0161961.4e-32Major facilitator superfamily domain, general substrate transporter
[128-408] IPR0117019.2e-17Major facilitator superfamily
Orthology groupMCL25585 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203229-TA
ATGGATAAAGAGTGTGTGGGCTGGGGCTGGTGGCAGGCGCGCATTGCATTGCTACTGGCACTGCCAATTCTTCTCACCGGCATGTATGGCACCAACTATGTATTCCTAGCTGCGCGTACGCCGCACAGATGCCATATTCCAGAATGTGAAGATAATGCAACACTAACGGGGCCTGTGACACAAGAAGCGTGGTCACCATCTTGGTCTACCTGGGCATTACCACCAAATTCTGAATGCCATCGTCACCCAACACTCGTTGGAGTATGCAGCGAAGACGGTTTCGATAATAGCACTTTCTTGGATTGCGAAGAATTCGTGTATCAGCATCACACCAGTATTATGGCTGAGTTTAGTCTAGCTTGTCAAGAGTGGAAGCGGACTCTCGTTGGTACAGTGCATAATGTTGGCATGTTAGTTTCGTTGCCAATCATGGGCTACGTTTCTGATAGGTATGGTCGTCGAGCTGCTTTAGTGACTTGCGGCGTGGGTGCTGGTGTTGTGGGACTAATTAAATCCTTTGTCAACTCATATCACCTTTATCTAGCCTCAGAATTTTTTGAAACTGTTTTAGGTGCAAGCGTATACCCAGCAGCTTTTGTACTTATGATTGAATGGCTGGGCGTAGAACACCGAATTCTGGCCAGTTTGCTTCTTGGAATTCCTTTATCACTTGGGGCAGCTTCCCTAGCCCTTTTAGACTATTTGACTGCTTATTGGCGAACTTGGGCACGATTTGCATACCCGCCTTCCTTTTTACTACTTTTATATCCATGGGTGTTACCAGAGAGTGTAAGATGGCTTGTAGCTGGAGGGAAAGTCAAAGAAGCAGCTCGCGTTATAAAACAAGCGGCAAGAGCTAACAATGTTTCGTTACCTGAAGGGACACTTGATAAAATGTTGTCGACAGATCAAGTTACCGATAAAGTTTTAAATATTGTAGAAGAAGAAAGTATATTAAGAGCCTTTATTAAGTACAGCGCATTACGACGTCGTTTGCTCGTCTGTTTTGCTTGGTGGTTGTGTGCAGTATTCGTATTTTACGGGTTAGCAGTTCGTTCTCATGCCCTTGCTGGTTCCGCACATGCTAATTACGTGTTGCTCGCCGCTGCAGAGTTGCCAGCGCTGCTACTAAACACCCTGCTCTTAGATCGAGCTGGACGACGGCCTTTGCTCACCGCTGCTTTCTTGTTGACAGCTGTTGCGCTAATTGCAATACCATGTCTGCCTGACCGTGAGTATAGTAATCCATAG

Protein sequence:

>DPOGS203229-PA
MDKECVGWGWWQARIALLLALPILLTGMYGTNYVFLAARTPHRCHIPECEDNATLTGPVTQEAWSPSWSTWALPPNSECHRHPTLVGVCSEDGFDNSTFLDCEEFVYQHHTSIMAEFSLACQEWKRTLVGTVHNVGMLVSLPIMGYVSDRYGRRAALVTCGVGAGVVGLIKSFVNSYHLYLASEFFETVLGASVYPAAFVLMIEWLGVEHRILASLLLGIPLSLGAASLALLDYLTAYWRTWARFAYPPSFLLLLYPWVLPESVRWLVAGGKVKEAARVIKQAARANNVSLPEGTLDKMLSTDQVTDKVLNIVEEESILRAFIKYSALRRRLLVCFAWWLCAVFVFYGLAVRSHALAGSAHANYVLLAAAELPALLLNTLLLDRAGRRPLLTAAFLLTAVALIAIPCLPDREYSNP-