Monarch geneset OGS2.0

DPOGS203167
TranscriptDPOGS203167-TA1128 bp
ProteinDPOGS203167-PA375 aa
Genomic positionDPSCF300035 - 679340-681511
RNAseq coverage469x (Rank: top 26%)
Annotation
HeliconiusHMEL0157455e-17375.73% 
BombyxBGIBMGA011016-TA0.083.47% 
DrosophilaCG8654-PB2e-9346.42% 
EBI UniRef50UniRef50_Q7QDB31e-11756.20%AGAP003039-PA n=4 Tax=Culicidae RepID=Q7QDB3_ANOGA
NCBI RefSeqXP_973659.12e-11956.73%PREDICTED: similar to organic cation transporter [Tribolium castaneum]
NCBI nr blastpgi|910825354e-11856.73%PREDICTED: similar to organic cation transporter [Tribolium castaneum]
NCBI nr blastxgi|910825352e-11656.50%PREDICTED: similar to organic cation transporter [Tribolium castaneum]
Group
Gene OntologyGO:00550852.1e-30transmembrane transport
GO:00160212.1e-30integral to membrane
GO:00228572.1e-30transmembrane transporter activity
KEGG pathway 
InterPro domain[1-374] IPR0161964.1e-56Major facilitator superfamily domain, general substrate transporter
[1-364] IPR0058282.1e-30General substrate transporter
Orthology groupMCL10381 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203167-TA
ATGCTCGGAATTCTTGTGGGGAATATGATTTTTGGACACCTCTCTGATAGATTCGGGCGAAGATTACCATTCTTGATAGCAGTATTCCTTCAACTAGTAGCAGGAGTAACAACTGCATATTCAGTCAATTGGTATATGTTCACGGCTTTACGATTCCTATTGGCTGTGGCTACTGGGGGAACTATGGTCACAAGCTTTGTCCTTACTATGGAACTGATTGGCGCTAAATATAGAGATACAGTTGGGATTGTATATCAAATACCATTTAATCTTGGACACTTGTCACTTCCTTTATTTGGTTACTTCTTACGAGACTGGAGCTCGTTCCAACTGGCAATCTCTCTACCATCTGTTCTGTTTCTTAGCTATTACTTTTTACTTCCGGAATCACCGAGATGGTTAATGACTGTAGGAAGGAGCAAAGATGCTCTTAAAATTATGAAAGCAGCTGCTAAAAGGAATGGCCGTCCAACAGATAAAATCGAAGTTTCCATGCAGAAAATGGTAGTGGATGCTAGTTCTACCCAGGAACGTGCTTCATTTCTGGCTCTTTTCCGAACACCACGTCTTCGAGCTCGAACTATTGCTATTTGCTTTAATTGGTTTGTCTGCGGATTCTGTTTCTTTGGAGTGTCACAATACATCGGACACGTGTCCGGTAATATATTCTTCAACGTCGCAATCTCAGCTGCCATACTAGTACCAGGAACCCTTATCTCGATTTATACGAACAGAGTCCTTGGTAGAAAGATAACTCTTATCAGTTCAAACTGTGTGACGGGTCTTAGCTGTCTTCTCATTATCTTCGTGCCACAGTCTTTGGCCAGCCACTTGACCCTTGGATGCTTCGGGTTATTTGGAATGAGTATATCGTTCGCGACAGTTTATTTGTACGCAGGGGAATTATTTCCAACTGTTGTACGAAATTCCGGCGTCGGGCTATCATCCACCGTGGCAAGGATAGGCTCTATGGTAGCTCCATTTGTGGCAACTCTTGCTCATACCAGTGCCATTTTGCCACCTCTACTTTTTGGTATAGTACCTCTCATAGGTTCCTGCCTATGTATTATATTACCAGATACTAGAGGAAAGAAATTACCCGATACTTTGGAGGAGGGCGAAGTGTAA

Protein sequence:

>DPOGS203167-PA
MLGILVGNMIFGHLSDRFGRRLPFLIAVFLQLVAGVTTAYSVNWYMFTALRFLLAVATGGTMVTSFVLTMELIGAKYRDTVGIVYQIPFNLGHLSLPLFGYFLRDWSSFQLAISLPSVLFLSYYFLLPESPRWLMTVGRSKDALKIMKAAAKRNGRPTDKIEVSMQKMVVDASSTQERASFLALFRTPRLRARTIAICFNWFVCGFCFFGVSQYIGHVSGNIFFNVAISAAILVPGTLISIYTNRVLGRKITLISSNCVTGLSCLLIIFVPQSLASHLTLGCFGLFGMSISFATVYLYAGELFPTVVRNSGVGLSSTVARIGSMVAPFVATLAHTSAILPPLLFGIVPLIGSCLCIILPDTRGKKLPDTLEEGEV-