Monarch geneset OGS2.0

DPOGS209194
TranscriptDPOGS209194-TA1608 bp
ProteinDPOGS209194-PA535 aa
Genomic positionDPSCF300061 + 607562-613713
RNAseq coverage292x (Rank: top 38%)
Annotation
HeliconiusHMEL0032491e-8735.26% 
BombyxBGIBMGA009197-TA0.061.83% 
DrosophilaCG7458-PA7e-7031.58% 
EBI UniRef50UniRef50_E2A7K42e-8132.90%Solute carrier family 22 member 21 n=7 Tax=Formicidae RepID=E2A7K4_CAMFO
NCBI RefSeqXP_391853.25e-8134.46%PREDICTED: similar to CG7442-PA [Apis mellifera]
NCBI nr blastpgi|3838630031e-8334.07%PREDICTED: solute carrier family 22 member 21-like [Megachile rotundata]
NCBI nr blastxgi|3838630034e-8434.76%PREDICTED: solute carrier family 22 member 21-like [Megachile rotundata]
Group
Gene OntologyGO:00550856.6e-22transmembrane transport
GO:00160216.6e-22integral to membrane
GO:00228576.6e-22transmembrane transporter activity
KEGG pathway 
InterPro domain[135-527] IPR0161964e-37Major facilitator superfamily domain, general substrate transporter
[134-490] IPR0058286.6e-22General substrate transporter
Orthology groupMCL30640 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209194-TA
ATGTCGGCGCCTGTTGAAGAGGTTCAGTTTAACCTGGACGGTATCCTCAGTGAGCTGGGAAGCTTTGGCAAGTACCAGCTGTTTCTGCTCGTACTGCTGGCTTTCAGGGACTCATTCCTTCACATGTGCAACTTCAATTACGTCTTTACTGCCGCGGACGTTAATTTCAGATGTCACGTGCCAGAGTGCGAGCTATCTCCACCTATATTCAACACTTCATGGAGCGACTACGCCACTCACAAATCCGTATGCCTCCGCTCAGTTACTGAGCATTCAAACATAACTGAGTGTACGCCGGATATATTCTTGGAGAACAAAACTATGCCTTGTGATGACGTTATATATGAGAATTACAACAGTATTGTTGCGGAGTTCAACTTGGGATGTCAGCCTTGGAAAAGGACTTTAATAGGGACCGTACATAATCTGGGACTTGTATTTTCCTACATTATTTCGGGGATAATTTCGGACAGGTATGGTCGTAAGGTGATCATAGTTGGTACACCGCTGATGGTGGGTATCGCTGGCTTACTCAAATCATTCACCTTCGACTACTGGTCGCTACTCGTGCTCGAGTTTCTTGAAACGGCGCTGGGTTACGGAAACGCTTCTATGGTGTTATCCCTGGAGACAGTCAGTCAGAAAAGTCGGGTAGCGTTCTCATGCATTTCGGACATTCTATCAAGTCTTGGCGCAGCGCTTTTCGGTCTCATAGCTTGGAAAATTCCTTACTGGAGATACATGATGAGAGCTATTTATGCTCCATTACTGATTGTGGTATTTTATATTTTCCTCGTCGATGAGGGAGTGCGGTGGCTCTTAGCGCACAACAGAAAAGAAGAGGCTGTCCGCGTCTTAAACAAGGTGGCTAAAATAAATAACATCACTCTCTCAAGCAAAGCCAAGGAAATGCTATCGACCATATCCAACGAGCAGAACAGGATGGTTGAGATGAATCAGCAGTCTAAGGACGAGCCGGTCGAGATCCTCGCAGTCCTCCGATCAAAGACCATCCTCGTACGTTTGTCAGTGATAGCTGCTTGCTTCATATGTAGCATGTTCGTGTTCAGCGGGGCCATAATATATTCCACTAATATATCTGGAAACAAATATATGAACTATTCAATGATGATACTCTTAGGAGTTCCAACCAGAATCATCACAGCCTTCACGTTGACCAGGTTCGGAAGGAAGGCTCCTATATGCATCGCGTACTGTCTCTGCTCGTGTTTCTTCATGGCGTCCGCTTTCGTGCCCAAATCAGTAGCATGGGCTTCCACTATCCTGTACATGATCGGCAAGATGTGCAGCAGTTACGGAATCTTCTCCGTGTATGTGGTCGGCATGGAGGTATTTCCGACTACGTCCAGGAACTCTCTAGTCAATATCGCCAACACCGTCGGCAGGATCGGCTCTGTTATTGCTCCACAGACGCCCTTACTGGAAAAGTACATGCCCGGTCTGCCTGTGGTTGTGTTTTCATTGGCTGCGCTGGGACCCGCGCTGCTCGCGCTCCTACTGCCGGACACGAGCGCCGCTCTGCCTGACAGACTCAAGGACCTCCAGGTACAGGAGAAGGATGGCGGTGTAACCAGCGCCAGTGCGTAG

Protein sequence:

>DPOGS209194-PA
MSAPVEEVQFNLDGILSELGSFGKYQLFLLVLLAFRDSFLHMCNFNYVFTAADVNFRCHVPECELSPPIFNTSWSDYATHKSVCLRSVTEHSNITECTPDIFLENKTMPCDDVIYENYNSIVAEFNLGCQPWKRTLIGTVHNLGLVFSYIISGIISDRYGRKVIIVGTPLMVGIAGLLKSFTFDYWSLLVLEFLETALGYGNASMVLSLETVSQKSRVAFSCISDILSSLGAALFGLIAWKIPYWRYMMRAIYAPLLIVVFYIFLVDEGVRWLLAHNRKEEAVRVLNKVAKINNITLSSKAKEMLSTISNEQNRMVEMNQQSKDEPVEILAVLRSKTILVRLSVIAACFICSMFVFSGAIIYSTNISGNKYMNYSMMILLGVPTRIITAFTLTRFGRKAPICIAYCLCSCFFMASAFVPKSVAWASTILYMIGKMCSSYGIFSVYVVGMEVFPTTSRNSLVNIANTVGRIGSVIAPQTPLLEKYMPGLPVVVFSLAALGPALLALLLPDTSAALPDRLKDLQVQEKDGGVTSASA-