Monarch geneset OGS2.0

DPOGS202647
TranscriptDPOGS202647-TA1365 bp
ProteinDPOGS202647-PA454 aa
Genomic positionDPSCF300039 - 507924-520116
RNAseq coverage86x (Rank: top 63%)
Annotation
HeliconiusHMEL0080694e-14252.74% 
BombyxBGIBMGA000844-TA1e-13952.40% 
DrosophilaCG8785-PB7e-11044.37% 
EBI UniRef50UniRef50_Q7K2W31e-10744.37%CG8785, isoform A n=18 Tax=Endopterygota RepID=Q7K2W3_DROME
NCBI RefSeqXP_001861307.18e-12149.20%amino acid transporter [Culex quinquefasciatus]
NCBI nr blastpgi|1700504282e-11949.20%amino acid transporter [Culex quinquefasciatus]
NCBI nr blastxgi|3838568204e-11848.10%PREDICTED: proton-coupled amino acid transporter 4-like [Megachile rotundata]
Group
KEGG pathwaytgu:1002324132e-10 
 K08653 (MBTPS1)maps-> Protein processing in endoplasmic reticulum
InterPro domain[43-446] IPR0130572.5e-71Amino acid transporter, transmembrane
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202647-TA
ATGACGCAGAATACAAGTATGGATGACAAAAACAATGTGTTCAGGATGGAATCTACGACTACCCTTCGGTCAGAGAGTGTTGACCTTAATGAAAAATACAACCCTTTTGAAAACAGGAATGTGCCCCACACGACGTCGACACTGGGTTCATTCTTCCATTTGCTCAAATCGGCGTTAGGAACAGGTCTGCTAGCTATGCCAGCCGCATTTAAAAACAGTGGCCTTATCCCAGGAAGCATCGGAATAGTATTAGTGGCAGTTATCGCTACCCATTGTGTTCATATATTAGTAAAAACCTCTCGCGACATCTGCGAAGAATGTCGTCTGGGATCATTAAGTTACACAGATACATGTGTCAAAGTATTTAAACACGGACCTAATAGACTAAGGTCTTACACTGGATTCGTAAGAAACTTTGTTGACTACGCTATGGCTGGAGTTTGTCTCGGCGGGACCAGTGTTTATGTCATATTCATCGCGTCTTCCTTAAAAAATATATTGGACCACTTCTATCCGGAACATAAGTATTCAGTGGAACTGTATTGTGCCATATTACTTTTGCCTCTTGTCGTTCTTACTCAAGTGAGACATCTAAAATTTCTTGTTCCATTCTCCATATTTGCAAATGTATGCCTCCTTTTGACATTCATAGCGACTTGTTATTACACCTTTATGGATTTATCAAAGGCGCCTGATGTCAATCTTATCTCTAGTGTAGAGCAATGGCCTCTATTTCTGAGTACAGCTATATTTTCGATGGAAGGAATCAACGTGGTTATGCCAGTAGAGAATGAGATGAGTAATCCGGAACATTTTCTGGGCTGTCCTGGAGTGTTGAATGCCACTATGTTGGTAGTCGTCATCCTGTATGCTGTCGTGGGATTTTTTGGATACTTGAAATATGGGGAAAGTGTACTTGGAAGCATAACATTGAACTTACCAGAAGATGAAATACTGGCATTGGCAGCCAAAATTCTTGTTGCTGTGGCTGTATTCTTTACATATTTCCTCCAAATGTACGCTCCAATGGACATTTTATGGTTGCGTATGAAGGAAAGAATCAGTCAAAAATATCATAACCTTGGACAGATAATACTACGAACTGTGAGTGTAACGATAACAGTCGTCTTAGCAGTTGCCGTACCCGACCTCGAACTTTTAATCGGTCTCGTCGGAGCCATATTCTTTTCAACATTAGGTCTACTAATTCCAATTGTGGTTCAAACCGTTCACAAATGGGAGAGAGGCTTGGGGAAATATTCTTATATTTTATGGAAAAACGCTCTTTTATTAATAGTGTACATAGTTGTAATAGTATCTGGTTGTTATTCAAGTATCACAAAAATTATTGAAAAATTTATATAA

Protein sequence:

>DPOGS202647-PA
MTQNTSMDDKNNVFRMESTTTLRSESVDLNEKYNPFENRNVPHTTSTLGSFFHLLKSALGTGLLAMPAAFKNSGLIPGSIGIVLVAVIATHCVHILVKTSRDICEECRLGSLSYTDTCVKVFKHGPNRLRSYTGFVRNFVDYAMAGVCLGGTSVYVIFIASSLKNILDHFYPEHKYSVELYCAILLLPLVVLTQVRHLKFLVPFSIFANVCLLLTFIATCYYTFMDLSKAPDVNLISSVEQWPLFLSTAIFSMEGINVVMPVENEMSNPEHFLGCPGVLNATMLVVVILYAVVGFFGYLKYGESVLGSITLNLPEDEILALAAKILVAVAVFFTYFLQMYAPMDILWLRMKERISQKYHNLGQIILRTVSVTITVVLAVAVPDLELLIGLVGAIFFSTLGLLIPIVVQTVHKWERGLGKYSYILWKNALLLIVYIVVIVSGCYSSITKIIEKFI-