Monarch geneset OGS2.0

DPOGS211219
TranscriptDPOGS211219-TA1461 bp
ProteinDPOGS211219-PA486 aa
Genomic positionDPSCF300007 + 1128972-1132244
RNAseq coverage879x (Rank: top 14%)
Annotation
HeliconiusHMEL0124830.082.34% 
BombyxBGIBMGA003198-TA0.075.71% 
DrosophilaCG8602-PA1e-14859.38% 
EBI UniRef50UniRef50_E1JHT45e-14257.35%CG12194, isoform B n=43 Tax=Neoptera RepID=E1JHT4_DROME
NCBI RefSeqXP_001651759.15e-16058.87%hypothetical protein AaeL_AAEL006002 [Aedes aegypti]
NCBI nr blastpgi|1571119411e-15858.87%hypothetical protein AaeL_AAEL006002 [Aedes aegypti]
NCBI nr blastxgi|1571119412e-15858.87%hypothetical protein AaeL_AAEL006002 [Aedes aegypti]
Group
Gene OntologyGO:00550851.7e-18transmembrane transport
GO:00160211.7e-18integral to membrane
KEGG pathway 
InterPro domain[1-389] IPR0161961.7e-47Major facilitator superfamily domain, general substrate transporter
[6-237] IPR0117011.7e-18Major facilitator superfamily
Orthology groupMCL11183 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211219-TA
ATGTGTTTCCTTTGTTTTGGTTCATATTTCTGTTATGATACACCGGGTGCTCTGGCCGACAACTTCAAGGGTGACTCACATCTAAACACATCCCAGTTTGCGCTGCTCTACTCTATATACTCATGGCCTAATGTAATATTATGCTTTATTGGCGGTTATCTCATTGATAGATATTTCGGTGTGAGATTGGGTACAATAATTTATATGACCATAGTGTTCATTGGAGCAGTTGTATTCGCTTTTGGTGTTTACATTAATCAATTTTGGCTTATGATACTTGGAAGATTTATATTTGGTATTGGAGGAGAGTCTTTGCAAGTAGCGGTTAACAACTATGTAGTTCTTTGGTTCAATGGAAAGGAACTTAATATGGTGTTTGGACTCCAATTGTCGTTCTCAAGATTTGGAAGCACTGTGAATTTTTGGGTCATGGAACCTATTTACAAATGGGTGGCCACCTATTATGCCGGCTATGAGAAACTTGGTGTAACTTTGTTTATAGCCTCATTAACCTGTCTCGGTTCTCTCATTTGTGGTCTGATTCTGGGTTGGATGGACCATAGAGCTGAGAAAATGTTAAATAGACAGGAAGCTCAAGCTAAGGACGAACCATTCAGACTCATTGACATATTTAACTTCAAGCCAGTGTACTGGCTTGTGTGTGTTATATGTGTGGCTTATTATTTGGCTATATTCCCATTTATTGCATTGGGAAAGATGTTTTTTGAAAGGAAATTTGACTTTATGCCTCAAGATGCTAACACTGTCAATTCCATGGTATACCTACTGTCTGCGGCGCTCAGTCCATTTTTTGGTATCCTCATAGATAAGACTGGTAGGAATGTGACTTGGGTCATACTCAGTATAGTAACAACTATCGGATCACATTTTCTTTTGGCCTTCACATTTATCAATCCGTATGTTGGAGTGATGTCCTTAGGAATTTCCTATTCGCTGCTAGCCAGTGGTCTGTGGCCGCTCATTGCCATGATTGTACCTGAGAACCAGTTGGGAACAGCATATGGGATATCTCAAGCCGTCCAAAACATGGGTTTGGCCACAGTTCTCATTTTAGCTGGAATCATTGTTGATAAATATGGATATTTGATGTTAGAAATGTTTTTCCTGGGATGTCTTTTTATCTCTTTGATAGCTGGTGTTGTTATATATATAGTGGATTCAGCGAACAATGGTATCCTCAATCTCTCTCCGAGTGTCAGGGAAGCTTATTTAAATAAATCAGCGAACCGTGCTGAAGAGACAGCCAATCTCCTGGATCACGTGGACAGCTCGGATCAAGAGGAAGTTGACTCGCGATTGACCAATCAACACAGTTCCCAAAACGCTGGAATCGAATCGTCCGGGGATTTTCCGCGGGACCCTGCGCAGGCGCATTCCGACAGAATACGATCTAGATACATATCGCAACTACTGCCCTCAACCATTCCAGACAGATCCTAA

Protein sequence:

>DPOGS211219-PA
MCFLCFGSYFCYDTPGALADNFKGDSHLNTSQFALLYSIYSWPNVILCFIGGYLIDRYFGVRLGTIIYMTIVFIGAVVFAFGVYINQFWLMILGRFIFGIGGESLQVAVNNYVVLWFNGKELNMVFGLQLSFSRFGSTVNFWVMEPIYKWVATYYAGYEKLGVTLFIASLTCLGSLICGLILGWMDHRAEKMLNRQEAQAKDEPFRLIDIFNFKPVYWLVCVICVAYYLAIFPFIALGKMFFERKFDFMPQDANTVNSMVYLLSAALSPFFGILIDKTGRNVTWVILSIVTTIGSHFLLAFTFINPYVGVMSLGISYSLLASGLWPLIAMIVPENQLGTAYGISQAVQNMGLATVLILAGIIVDKYGYLMLEMFFLGCLFISLIAGVVIYIVDSANNGILNLSPSVREAYLNKSANRAEETANLLDHVDSSDQEEVDSRLTNQHSSQNAGIESSGDFPRDPAQAHSDRIRSRYISQLLPSTIPDRS-