Monarch geneset OGS2.0

DPOGS215674
TranscriptDPOGS215674-TA1152 bp
ProteinDPOGS215674-PA383 aa
Genomic positionDPSCF300041 - 1140125-1145291
RNAseq coverage20x (Rank: top 80%)
Annotation
HeliconiusHMEL0163581e-9949.18% 
BombyxBGIBMGA001007-TA4e-2725.55% 
DrosophilaCG8028-PB2e-3723.53% 
EBI UniRef50UniRef50_D2A1P84e-3830.23%Putative uncharacterized protein GLEAN_07071 n=3 Tax=Tribolium castaneum RepID=D2A1P8_TRICA
NCBI RefSeqXP_971903.21e-3830.30%PREDICTED: similar to monocarboxylate transporter [Tribolium castaneum]
NCBI nr blastpgi|2700050641e-3730.23%hypothetical protein TcasGA2_TC007071 [Tribolium castaneum]
NCBI nr blastxgi|1892368162e-3930.39%PREDICTED: similar to monocarboxylate transporter [Tribolium castaneum]
Group
Gene OntologyGO:00550854.3e-15transmembrane transport
GO:00160214.3e-15integral to membrane
KEGG pathway 
InterPro domain[6-363] IPR0161961.9e-25Major facilitator superfamily domain, general substrate transporter
[36-364] IPR0117014.3e-15Major facilitator superfamily
Orthology groupMCL35066 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215674-TA
ATGAAGAGTAACGCTATGACCACGGAAAAATCAAACAGAAAATTTGAATATGTTGCTCCAGAAGGAGGTTGGGGTTACGTAGTGTGTATCGGACATATTATAAATTCGATCACCATAACCGCTTTTCTCAGTTGCTTCGGCATGATCTACAAAGATTTGTTAATTAAATTGAACATGGATTCTACTTCCATCACGTTGCTAAACGGGATCAGTGCGACTTGTATGTCGTTATCAGGATTCCTCACCAGTCCTATGTTGAAATTTTTGTCCATTAGACAACTGGGCTTGGTAGCGGCTGTAATATTTAATCTAGGGATGCTAGGGATGGTATTTGTATCGTCTAAAATAACATTCTACGTTTGTTTCGGTATATTACAGAGTCTCGGGAACGGAATTATATTCAATTTGTCTTGCACGGTGTTGAACAATTACTTTGTTAAGAAAAGATTGCTTGCAATAAGTTTCACCCAAACTATTATAGCTATATCTGGACTCGTCGTACCCCAGTTTTTAAACTGGTCGTTCGATGCATACGGACAACGAGGAACTCTGCTTTTGGTATCCGGGATTTGTATACACAATATATTTGGCATTATCCTGATGCAACCTATCGCTTGGCATTTGAAACAAGTTGAAGTACCTGAAACAGAAAAGGACACATTTGAAACAAAATCGTTTCTGACGAAGGAGCATCAGATTTCTAATAAGGAAAATTCCAAACAAAAATCAAATGATGCTGGCGAAGTTGTTATTAGTGAGAAATATAATCTTGAAGAAGGGACCATTTATAGCTCACAGTCTAGGAGTATATTATCGAACTTTGTGGATTTTAAAGTATACAGATCATTTCTATTATCAAATGCATACCTTGGTGTGGGACTTTGTACTTTCTCAGAGTTCACGTTTACAATAATGCTTCCTCAGGCTTTATATTCTATGGGATGGGATGAAACAAATGTAGCTTGGGCGTTATCGTTAATGGCTACAGGAGACTGCGCGTCAAGATTTTTATTAATATTTTTGAGCGGTTGGTTGGCAAATTTTGGAAGCCATGAGATTTACATTGTTGGACTCATAATAGGTTTTATTACTAAAATAGGTGAGTTATATATTATTAAGATTTCTTATAGTTATGATGTTCACAATGCCTGA

Protein sequence:

>DPOGS215674-PA
MKSNAMTTEKSNRKFEYVAPEGGWGYVVCIGHIINSITITAFLSCFGMIYKDLLIKLNMDSTSITLLNGISATCMSLSGFLTSPMLKFLSIRQLGLVAAVIFNLGMLGMVFVSSKITFYVCFGILQSLGNGIIFNLSCTVLNNYFVKKRLLAISFTQTIIAISGLVVPQFLNWSFDAYGQRGTLLLVSGICIHNIFGIILMQPIAWHLKQVEVPETEKDTFETKSFLTKEHQISNKENSKQKSNDAGEVVISEKYNLEEGTIYSSQSRSILSNFVDFKVYRSFLLSNAYLGVGLCTFSEFTFTIMLPQALYSMGWDETNVAWALSLMATGDCASRFLLIFLSGWLANFGSHEIYIVGLIIGFITKIGELYIIKISYSYDVHNA-