Monarch geneset OGS2.0

DPOGS204747
TranscriptDPOGS204747-TA1233 bp
ProteinDPOGS204747-PA410 aa
Genomic positionDPSCF300231 - 426818-429942
RNAseq coverage1x (Rank: top 93%)
Annotation
HeliconiusHMEL0104572e-16065.74% 
BombyxBGIBMGA013710-TA4e-12756.38% 
DrosophilaCG7442-PA2e-6836.18% 
EBI UniRef50UniRef50_B0XDF03e-6837.22%Organic cation transporter n=5 Tax=Culicidae RepID=B0XDF0_CULQU
NCBI RefSeqXP_001867673.14e-7438.37%organic cation transporter [Culex quinquefasciatus]
NCBI nr blastpgi|1700647938e-7338.37%organic cation transporter [Culex quinquefasciatus]
NCBI nr blastxgi|1571209651e-7538.50%organic cation transporter [Aedes aegypti]
Group
Gene OntologyGO:00550852.1e-22transmembrane transport
GO:00160212.1e-22integral to membrane
KEGG pathway 
InterPro domain[19-398] IPR0161961.7e-48Major facilitator superfamily domain, general substrate transporter
[20-351] IPR0117012.1e-22Major facilitator superfamily
Orthology groupMCL21023 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204747-TA
ATGTTTCTAGATGTAAGGATGGCCTCGAAGTTGGCTTGTCAGGAATGGAAGAGAACCTTGGTTGGAACAATACACAATGCTGGTTACATGTGCGGCCTGTGGCTAGTTGGGCCCATGTCTGATCGATTAGGAAGAAAAACTATAGCAATCGCTACATCGGTTTTGGGAGCATTATTTGGGACTTTGAGAAGTTTCTCATACTCGTACTGGTTTTATCTAGTTATGGAATTCTTAGAAGGAGCTCTTGGTGACTCTTTTTCACCATTTTATATTTTGGCGTTGGAGTTGGTATCACCAAAAAAAAGGATTCCGTTTTACATGTACTGCAGTTTTGGTTACTGTATTGGAGGAATTGTTGTGGCCCTTATAGCATGGGTGACACCAAATTGGAGATGGTTTCTGAGAGCGATTTACTTACCATCGTTCCTTTTTATATTTTATTCGTTATTATTAGACGAAAGCCCTCGTTGGCTTTACACTAAAGGTCGTAGAGATAAAGCTGAAAAGATTCTAGAAAATGCTAGCAAGAAAAATAAAATAGAGCTAGACAAGCAGGTTCTTGATAAGCTTTCTTGTAAAGTTAGCCCTAATGTGACTTTTAGTGAATTACTTAAAAGTACGTTCAAATCGAAACTTCTAAGGTGCCGTCTCTTAGTGTGCCTGGTCTGGTGGATAACATCTGCTCTCGTTAACTACGGTCTCTTAATAAATTCCGTTTCATTACAAGGGAACAAATATATAATTTTTGGTGTAATGTATTTGATTGATATTCCTGGAATTTTAATCTTTAGTTACATATGCAAAAAATTTAAAAGAAAACGTCCTTTGATGATATCATTTTTAGCAGGAGGAGTTTTTAGTATACTGCAACCGTTTGTACAAACGAATTTTCCTTGGGTCTCTTTAATATTTTATATGACGGCCAAGATGATGTCCATGCTTTACTTCAACCTCACATATCTTTACACATTAGAGCTGTTTCCAACATATACTCGGAACTCTATGCACGCGTTGTGCTCCTCCTTCGGCCGCCTTGGGTCAACGCTAGCGCCGCAAACTCCGTTACTTGCAATGTATTGGAAAGGTCTCCCATCACTTATATTTGGTCTCGCGGCAGTGATCGCTGCTCTGGTCACGTGTTTCGTACCCGACGTATCCAATGAATCGCTTCCGGATACAGTCGAGCAGGCTGAAGGAATGGGAACAACTAAGAAGACTTATATACAAGGATGA

Protein sequence:

>DPOGS204747-PA
MFLDVRMASKLACQEWKRTLVGTIHNAGYMCGLWLVGPMSDRLGRKTIAIATSVLGALFGTLRSFSYSYWFYLVMEFLEGALGDSFSPFYILALELVSPKKRIPFYMYCSFGYCIGGIVVALIAWVTPNWRWFLRAIYLPSFLFIFYSLLLDESPRWLYTKGRRDKAEKILENASKKNKIELDKQVLDKLSCKVSPNVTFSELLKSTFKSKLLRCRLLVCLVWWITSALVNYGLLINSVSLQGNKYIIFGVMYLIDIPGILIFSYICKKFKRKRPLMISFLAGGVFSILQPFVQTNFPWVSLIFYMTAKMMSMLYFNLTYLYTLELFPTYTRNSMHALCSSFGRLGSTLAPQTPLLAMYWKGLPSLIFGLAAVIAALVTCFVPDVSNESLPDTVEQAEGMGTTKKTYIQG-