Monarch geneset OGS2.0

DPOGS201205
TranscriptDPOGS201205-TA1311 bp
ProteinDPOGS201205-PA436 aa
Genomic positionDPSCF300262 + 618022-621608
RNAseq coverage163x (Rank: top 51%)
Annotation
HeliconiusHMEL0159080.078.28% 
BombyxBGIBMGA014248-TA6e-13170.78% 
DrosophilaCG15096-PA2e-11146.21% 
EBI UniRef50UniRef50_D6WYK79e-11149.76%Putative uncharacterized protein n=3 Tax=Tribolium castaneum RepID=D6WYK7_TRICA
NCBI RefSeqXP_319685.45e-11547.78%AGAP008931-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|2897431275e-11746.77%permease of the major facilitator superfamily [Glossina morsitans morsitans]
NCBI nr blastxgi|2897431272e-11847.76%permease of the major facilitator superfamily [Glossina morsitans morsitans]
Group
Gene OntologyGO:00550855.2e-49transmembrane transport
GO:00160215.2e-49integral to membrane
KEGG pathway 
InterPro domain[8-425] IPR0161961e-74Major facilitator superfamily domain, general substrate transporter
[7-365] IPR0117015.2e-49Major facilitator superfamily
Orthology groupMCL31074 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201205-TA
ATGAGCGTCACTATTCTTGCCATGACAAAAGAAAATAACTACGGATATCAGGTGTTCGATTGGAATAAAACAATTCAAGATACGATTCTATCTTCCTTTTTTTGGGGCTACATAGTTCTACAGATTCCAGCAGGTGTATTTGCTGGTCGTTTCGGTGGCAAGATGCTGATACTTGTGTCTATGATAGTGACCGGCTTGATTAACTTGATTGTACCGATCTCCGCAGTGAAGGGAGATTGGATGGCAGTCTGCGGTTGTAGGATAGGTATGGGTCTTTTCCAAGGAATATTGTATCCAAGTCTTCATGGACTTCTGGGACAATGGGCTCCGATTACAGAGCGAAGTCGTATGGGCACCTTCGTATATTCTGGATCACAATTGGGAACCGTTATCGAGATGATGTTGGCTGGTGTTTTGTCGGATAGTAAGTTTGGTTGGCCGTCTGTTTATTACGTTGCTGGCATTACCTGCATTTTATGGTCAGTTCTGTGGATCATATTCGGAGCTTCTAACCCCGCTGAAAGCAAATGGATCTCTAAAGAGGAACGGAAGTATATTGAAAGCAACGCAGGATCCTCTAACGCTAACGAAAATAAAAAAATTCCAGTACCATGGAAAAGTATTTTAACTTCATTGCCGTTTTGGTCAATTTTACTTTCTCATTGTGGCCAAAGCCTCGGCTTTTGGACACTGCTAACAGAAATGCCGTCTTATATGGATAAAGTTCTCGGAGTAAATATAAAAAGTACTGGTTACCTTTCGGCGTTACCCTACGTGGCCATGTATATCCTCAGCTTTGTATTCAGTTGGATTGCCGAATATCTGGTCAACAATAACGTCACCTCTCTCTCTTCCACGCGGAAGATCTTCAATACGATCGCATTTTGGGGTCCAGCCGCTTCACTGCTGGCTCTATGCTACATTCCCGCGGGCCACCTCACACTTGCTGTGGTTATGCTTACAATAACTGTTGGACTTAATGGAGCTCACTATGTTGGCTTTCTGATTTCTCATATTGACTTATCGCCAAACTTCGCTAGTACCTTGATGGGTATCACAAATGGCTGCGGAAACATATTCTCTATAATGGCACCTTTAAGTGTGTCAGTCGTTGTCTCGGATGAGAAAAGTGCAGCTGACTGGAGGAAAGTGTTCTTTATATCTATAGCCTTTTATTTTCTCAGCAATCTCTTTTTCATTCTCTTTATGTCTGGGAATGTTCAAGATTGGAATGAACCACAAAGCAAAAACACTATCGAGGAAGGAGAGAAAAAATCTAAAAAGGAGAAAACTGGAGAACACAAGTTTTAA

Protein sequence:

>DPOGS201205-PA
MSVTILAMTKENNYGYQVFDWNKTIQDTILSSFFWGYIVLQIPAGVFAGRFGGKMLILVSMIVTGLINLIVPISAVKGDWMAVCGCRIGMGLFQGILYPSLHGLLGQWAPITERSRMGTFVYSGSQLGTVIEMMLAGVLSDSKFGWPSVYYVAGITCILWSVLWIIFGASNPAESKWISKEERKYIESNAGSSNANENKKIPVPWKSILTSLPFWSILLSHCGQSLGFWTLLTEMPSYMDKVLGVNIKSTGYLSALPYVAMYILSFVFSWIAEYLVNNNVTSLSSTRKIFNTIAFWGPAASLLALCYIPAGHLTLAVVMLTITVGLNGAHYVGFLISHIDLSPNFASTLMGITNGCGNIFSIMAPLSVSVVVSDEKSAADWRKVFFISIAFYFLSNLFFILFMSGNVQDWNEPQSKNTIEEGEKKSKKEKTGEHKF-