Monarch geneset OGS2.0

DPOGS204748
TranscriptDPOGS204748-TA1011 bp
ProteinDPOGS204748-PA336 aa
Genomic positionDPSCF300231 - 422342-423959
RNAseq coverage0x (Rank: top 96%)
Annotation
HeliconiusHMEL0104571e-13064.76% 
BombyxBGIBMGA013710-TA9e-11356.80% 
DrosophilaCG7442-PA5e-5434.82% 
EBI UniRef50UniRef50_B0XDF08e-5436.26%Organic cation transporter n=5 Tax=Culicidae RepID=B0XDF0_CULQU
NCBI RefSeqXP_001867673.12e-5837.61%organic cation transporter [Culex quinquefasciatus]
NCBI nr blastpgi|1700647933e-5737.61%organic cation transporter [Culex quinquefasciatus]
NCBI nr blastxgi|1571209652e-6037.94%organic cation transporter [Aedes aegypti]
Group
Gene OntologyGO:00550859.3e-14transmembrane transport
GO:00160219.3e-14integral to membrane
GO:00228579.3e-14transmembrane transporter activity
KEGG pathway 
InterPro domain[12-324] IPR0161966.1e-35Major facilitator superfamily domain, general substrate transporter
[35-312] IPR0058289.3e-14General substrate transporter
Orthology groupMCL21023 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204748-TA
ATGGAATTCTTAGAAGGAGCTCTTGGTGACTCTTTTTCACCATTTTATATTTTGGCGTTGGAGTTGGTATCACCAAAAAAAAGGATTCCGTTTTACATGTACTGCAGTTTTGGTTACTGTATTGGAGGAATTGTTGTGGCCCTTATAGCATGGGTGACACCAAATTGGAGATGGTTTCTGAGAGCGATTTACTTACCATCGTTCCTTTTTATATTTTATTCGTTATTATTAGACGAAAGCCCTCGTTGGCTTTACACTAAAGGTCGTAGAGATAAAGCTGAAAAGATTCTAGAAAATGCTAGCAAGAAAAATAAAATAGAGCTAGACAAGCAGGTTCTTGATAAGCTTTCTTGTAAAGTTAGCCCTAATGTGACTTTTAGTGAATTACTTAAAAGTACGTTCAAATCGAAACTTCTAAGGTGCCGTCTCTTAGTGTGCCTGGTCTGGTGGATAACATCTGCTCTCGTTAACTACGGTCTCTTAATAAATTCCGTTTCATTACAAGGGAACAAATATATAATTTTTGGTGTAATGTATTTGATTGATATTCCTGGAATTTTAATCTTTAGTTACATATGCAAAAAATTTAAAAGAAAACGTCCTTTGATGATATCATTTTTAGCAGGAGGAGTTTTTAGTATACTGCAACCGTTTGTACAAACGAATTTTCCTTGGGTCTCTTTAATATTTTATATGACGGCCAAGATGATGTCCATGCTTTACTTCAACCTCACATATCTTTACACATTAGAGCTGTTTCCAACATATACTCGGAACTCGATGCACGCGTTGTGCTCCTCCTTCGGCCGCCTTGGGTCAACGCTAGCGCCGCAAACTCCGTTACTTGCAATGTATTGGAAAGGTCTCCCATCACTTATATTTGGTCTCGCGGCAGTGATCGCTGCTCTGGTCACGTGTTTCGTACCCGACGTATCCAATGAATCGCTTCCGGATACAGTCGAGCAGGCTGAAGGAATGGGAACAACTAAGAAGACTTATATACAAGGATGA

Protein sequence:

>DPOGS204748-PA
MEFLEGALGDSFSPFYILALELVSPKKRIPFYMYCSFGYCIGGIVVALIAWVTPNWRWFLRAIYLPSFLFIFYSLLLDESPRWLYTKGRRDKAEKILENASKKNKIELDKQVLDKLSCKVSPNVTFSELLKSTFKSKLLRCRLLVCLVWWITSALVNYGLLINSVSLQGNKYIIFGVMYLIDIPGILIFSYICKKFKRKRPLMISFLAGGVFSILQPFVQTNFPWVSLIFYMTAKMMSMLYFNLTYLYTLELFPTYTRNSMHALCSSFGRLGSTLAPQTPLLAMYWKGLPSLIFGLAAVIAALVTCFVPDVSNESLPDTVEQAEGMGTTKKTYIQG-