Monarch geneset OGS2.0

DPOGS209942
TranscriptDPOGS209942-TA2061 bp
ProteinDPOGS209942-PA686 aa
Genomic positionDPSCF300148 - 400955-404893
RNAseq coverage149x (Rank: top 53%)
Annotation
HeliconiusHMEL0099950.090.62% 
BombyxBGIBMGA011261-TA0.082.64% 
DrosophilaCG12858-PA0.056.34% 
EBI UniRef50UniRef50_D6WA600.068.01%Putative uncharacterized protein n=2 Tax=Endopterygota RepID=D6WA60_TRICA
NCBI RefSeqXP_001950245.10.063.66%PREDICTED: similar to AGAP003204-PA [Acyrthosiphon pisum]
NCBI nr blastpgi|2700016090.068.01%hypothetical protein TcasGA2_TC000461 [Tribolium castaneum]
NCBI nr blastxgi|2700016090.067.80%hypothetical protein TcasGA2_TC000461 [Tribolium castaneum]
Group
Gene OntologyGO:00550852.1e-13transmembrane transport
GO:00160212.1e-13integral to membrane
KEGG pathway 
InterPro domain[1-583] IPR0161964.9e-48Major facilitator superfamily domain, general substrate transporter
[407-575] IPR0117012.1e-13Major facilitator superfamily
Orthology groupMCL13025 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209942-TA
ATGCAGCACCCGCAGCAGCCGCCGCTGGCCGCGCGGCCCCTCGTCAACCCCGACGAGACTGGAGAAGTCGACACCTCCAGATACCCCGAACCTAAAGAGGCTACTCACAAAGTCCGAGGGCGAAGCGATGTCCTGGAGCTCATCTGTGGGCCGGGGACCGTCGACCCCGAGCTCCTCACCGTCAAGACTTTCTACTTCTTCTTCTACTCTGCTTTCGGATCACTATTCCCTTTAATGGGAGTTTACTTCAAACAGATGGGAATGAACGCTGGCCAGTGTGGGCTACTTATCGGTACTAGACCTTTCGTAGAATTTTTATCGGCACCATTTTGGGGTGGACTGGCAGATAGATGGCAGAAGGGAAGAATATTATTACTAGCGTCTCTAACCGCCTGGATAGTGTTCACACTGCCGCTGAGCTGGGTCCAGCCGACAGCCGTGTCTTGCGTACAACCTGTCAACAGCACCGTTTACCGTCTGGTCTCGCCGCGGTACGACGAGGACTGGCCCACGCCCACTAGACACTTCCGCGGTCCGGCTCTGGGTCGCGAGGGATCGCCGCTTCCTGTGACGGATGCAGAGAACTACAATCCGGATACCAACTACAACTGGGTGACGCCACTGCACTCCTACATCGTATACAGTACCCCGGACATACAAAAGACATTTTTCTTATTGTTGCTGCTCGTTGTAATCGGAGAGTTCTTCAGCGCGCCCGCTATTACTTTAGCGGATTCTGCGGTTATAACATTACTCGGCGAAGATGCTGACAGATACGGTCACCAGCGCATGTTCGGTTCCTTGGGCTGGGGCTTAGCCATGTTCTTCGTGGGTATCGCGCTGGACCACAGCACTGCCTTCAGCTCTCACCCTTGTGGCGGTCCTCAGCGCTACGAGAAGAATTACACGATCTGCTTCGCTACATTCTCGGTCCTGATGGGTGCCGCACTAATTACTGCCACCCAGATTAATTTTAAATACGAGGAAATTAACGTTGAAACCCCCTTGGAGCCACCTCCGCCCGCGGAACCTTCTCACGAGGAACGCATGCAGCAGCAACTGGCGGAACAGCTGCAACTTCCTGGACTGGACACCAGCGCGCCGGCGCCCCGGCAGCCGCCTCTCGAACACGCTAAGGTGTTCGCTCAGACCACTCGCGAGATGCCGGAGTGGGTGACGGTACTACGGCAATTTCAGAACGTGAAAGCTGCGTCCTTCCTGTTAGTCGCCTGGTTCATGGGCTTCGGGATTGGACTGATCTTCACATTCCTTTTCTGGCACTTACAGGATATCGGCGGCTCACCGACGCTTTTTGGTGTCGCTTCCGTCATCAACCACATCTCCGAGATCTTCGCCTACTTCTTCAGTTTCAAGCTTATCACTCAAATGGGACATGTTAAAGTATTATGCTTGGGTCTCGCCGGGAACGTGGTGCGCTTTCTATACATCTCTTGGCTGACGCGACCCTGGTGGGTGCTTCCTTTCGAGTTTGTCCAGGGTGTCACCCACGCCGCCGTGTGGGCGGCCTGCTGCTCCTATATAGCTCACGGCTCGCCACCCAACCTTCGTTCATCCGCACAAGGAGTGCTCCAGGGCCTGCACCACGGCCTGGGGCGAGGTTGCGGCGCGGTGCTAGGAGGCATCGCGGTAGCCAAATGGGGAACGACTCGCACCTTCGCCGGCTACGGTCTGTTGTGTGGTGTGGCGCTCGCAGCATTCGCCTTTGTGAACTTCCGCGATGGCGGCATGGGCCCGACGATTCCTGGCGACTCCGCGGCGGACGAAGAAGCTCGTGCAGTGGCGGAGGCGGGCGTGCTAGCTCCTCACGGTGTCCCTTCTAATCCGCTCCCTCGAGCCCTATCGTCCACTCGCCTGGCAGACTTGGCCAACCACGACAACTACGGCGCCACACAGAGTTACGCCGGAGCGGACAGCCTCGGCGTGCCGGGAGCCGCGCCGCCCCCGCAGCCCGGTCCCGGCCCCGGCCCCGGCCCGTCCGGCCCCGGCCCGCGACCGGCCAACCCCCTTCTGGCCGAGTCCGCCGGGGCCGCTTACCGATAG

Protein sequence:

>DPOGS209942-PA
MQHPQQPPLAARPLVNPDETGEVDTSRYPEPKEATHKVRGRSDVLELICGPGTVDPELLTVKTFYFFFYSAFGSLFPLMGVYFKQMGMNAGQCGLLIGTRPFVEFLSAPFWGGLADRWQKGRILLLASLTAWIVFTLPLSWVQPTAVSCVQPVNSTVYRLVSPRYDEDWPTPTRHFRGPALGREGSPLPVTDAENYNPDTNYNWVTPLHSYIVYSTPDIQKTFFLLLLLVVIGEFFSAPAITLADSAVITLLGEDADRYGHQRMFGSLGWGLAMFFVGIALDHSTAFSSHPCGGPQRYEKNYTICFATFSVLMGAALITATQINFKYEEINVETPLEPPPPAEPSHEERMQQQLAEQLQLPGLDTSAPAPRQPPLEHAKVFAQTTREMPEWVTVLRQFQNVKAASFLLVAWFMGFGIGLIFTFLFWHLQDIGGSPTLFGVASVINHISEIFAYFFSFKLITQMGHVKVLCLGLAGNVVRFLYISWLTRPWWVLPFEFVQGVTHAAVWAACCSYIAHGSPPNLRSSAQGVLQGLHHGLGRGCGAVLGGIAVAKWGTTRTFAGYGLLCGVALAAFAFVNFRDGGMGPTIPGDSAADEEARAVAEAGVLAPHGVPSNPLPRALSSTRLADLANHDNYGATQSYAGADSLGVPGAAPPPQPGPGPGPGPSGPGPRPANPLLAESAGAAYR-