Monarch geneset OGS2.0

DPOGS210498
TranscriptDPOGS210498-TA1533 bp
ProteinDPOGS210498-PA510 aa
Genomic positionDPSCF300186 - 66530-70146
RNAseq coverage69x (Rank: top 66%)
Annotation
HeliconiusHMEL0222650.065.17% 
BombyxBGIBMGA012587-TA3e-14969.01% 
DrosophilaCG7442-PA1e-8134.52% 
EBI UniRef50UniRef50_E2A7K43e-8634.76%Solute carrier family 22 member 21 n=7 Tax=Formicidae RepID=E2A7K4_CAMFO
NCBI RefSeqXP_391853.22e-8736.36%PREDICTED: similar to CG7442-PA [Apis mellifera]
NCBI nr blastpgi|3800153661e-8836.44%PREDICTED: solute carrier family 22 member 21-like [Apis florea]
NCBI nr blastxgi|3287826985e-9035.78%PREDICTED: solute carrier family 22 member 21-like [Apis mellifera]
Group
Gene OntologyGO:00550854.7e-29transmembrane transport
GO:00160214.7e-29integral to membrane
GO:00228574.7e-29transmembrane transporter activity
KEGG pathway 
InterPro domain[105-479] IPR0161966.4e-36Major facilitator superfamily domain, general substrate transporter
[105-474] IPR0058284.7e-29General substrate transporter
Orthology groupMCL25982 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210498-TA
ATGCTGGCCCTCGTCTACTCTACCAACTCCATGTATATCGTGAACTACGTCTTCGCGGTTGAAGATGTCAGTTACAGGTGTAAGGTCCCGGAATGCGAGAGCGGCAGCAGTTTCTCCGTCCCCTGGCTGAACGCTTCGAGTTTGGATTCGTTTGAGGTCGGTGTGAAGCAATGTCACCGCAGTCCACCGCTCAACGGACGCTGTACACACTTCAATCAAACTGAACTGATGAGATGCGATGAGTGGGTGTACGAGGTCCCCGACAGCTTCGTGGCCGAATTTGGTTTAGCCTGCCAGGACTGGAAACCCCCTTTAGTCGGGACCATACATAGCCTCGGATGCCTCATAGGACAAATAATACAAGGACAAATATCAGACAGGTTCGGTCGTAAGACAGCTGCAGTATTCTCGGGCACAATGGGGGCTGTACTCGGTCTATCCAAAAGTTTTGCCTCGTCTTTCTGGGTGTATCTCGCGCTGGAGGGCCTGGAGGCTACTATCGGAGATGCCTTATCTCCTATGTTTATGTTAAGCATCGAGATCGTAGATAAGCAGCGTGCTGTGTTATATCAAATGATATTACTGAATTTCTACACTATCGGTCAAATTGTAATGTCATTTGTCGCCTGGGCGGTGCCATACTGGAGGAACTTCCTCCGTGTGATCTACGCGCCGACTCTCCTCATAATTACATACTCATTCTTTTTGGATGAGAGTATTCGATGGCTTTTTAGTAAAGGACAAAAAGAGAGAGCTATCCGATTAATAGAAAAAATAGCAAAAAGAAACAATGTACAGATTGACCGAAATATGATTAACAAACTTGAGTATACGGATGAAAAAACTTCAAGCAAAGCAGACAGGAAGTTGCTGTTAAAGACTTTTAAATCACAAATAATGATGCGAAGGTTCCTCGTGTGTCTCGTCTGGTGGTTCACGATCACTCTCATCAACTACGGTATGATGATCAGCTCGGTTCTCATCAACGGCAACAAGTACTTGAACTTCGCTCTCCTTATAATGATGGACATTCCGTCCAATATCTTCTATTGGTTAGCTTTGTCAAAGTATAAAAGAAAGATCCCGCTGATGGGATCGTTCGTCATGGGTGGGATTTTTTGTATCTCCCAACCTTTTGTTCCTAAAGACCTGGCGTGGATGGGCTTGGCTCTTTTTATGTTATTCGAGATGCTGGCCACCTTCTCTTACAACATTGTGTACATGTACACGTCCGAGCTCTTCCCGACTTACACCAGGAACTCCATGCACTCCATTTGCTCCGCCATAGGACGAGTAGGATCCCTGATTGCGCCTCAGACACCCCTCTTGATGACCTACTGGTCAGGTTTGCCCGCGCTCCTCTTCGGTCTGTCGTCCCTCGTTTCTGGAGCCCTGACTATCTTCATGCCGGAGACAGCGTGCACTCAGCTACCTGACACGGTGAGGGAAGCTGAGGCCCTCGGGAGGAAAACAAATAAGAAGAGATCACAGATGCATTTCGACCAGACAGAACAAATGTTGAAAGCGTCATAA

Protein sequence:

>DPOGS210498-PA
MLALVYSTNSMYIVNYVFAVEDVSYRCKVPECESGSSFSVPWLNASSLDSFEVGVKQCHRSPPLNGRCTHFNQTELMRCDEWVYEVPDSFVAEFGLACQDWKPPLVGTIHSLGCLIGQIIQGQISDRFGRKTAAVFSGTMGAVLGLSKSFASSFWVYLALEGLEATIGDALSPMFMLSIEIVDKQRAVLYQMILLNFYTIGQIVMSFVAWAVPYWRNFLRVIYAPTLLIITYSFFLDESIRWLFSKGQKERAIRLIEKIAKRNNVQIDRNMINKLEYTDEKTSSKADRKLLLKTFKSQIMMRRFLVCLVWWFTITLINYGMMISSVLINGNKYLNFALLIMMDIPSNIFYWLALSKYKRKIPLMGSFVMGGIFCISQPFVPKDLAWMGLALFMLFEMLATFSYNIVYMYTSELFPTYTRNSMHSICSAIGRVGSLIAPQTPLLMTYWSGLPALLFGLSSLVSGALTIFMPETACTQLPDTVREAEALGRKTNKKRSQMHFDQTEQMLKAS-