Monarch geneset OGS2.0

DPOGS203302
TranscriptDPOGS203302-TA1335 bp
ProteinDPOGS203302-PA444 aa
Genomic positionDPSCF300003 - 1123127-1125275
RNAseq coverage24x (Rank: top 77%)
Annotation
HeliconiusHMEL0166530.074.79% 
BombyxBGIBMGA003876-TA6e-10968.91% 
DrosophilaCG1139-PA3e-9241.22% 
EBI UniRef50UniRef50_E1ZXG53e-10243.76%Proton-coupled amino acid transporter 1 n=6 Tax=Endopterygota RepID=E1ZXG5_CAMFO
NCBI RefSeqXP_001861480.12e-10044.71%amino acid transporter [Culex quinquefasciatus]
NCBI nr blastpgi|3454957088e-10545.28%PREDICTED: proton-coupled amino acid transporter 1-like [Nasonia vitripennis]
NCBI nr blastxgi|3071898983e-10643.76%Proton-coupled amino acid transporter 1 [Camponotus floridanus]
Group
KEGG pathway 
InterPro domain[35-434] IPR0130577.1e-60Amino acid transporter, transmembrane
Orthology groupMCL24948 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203302-TA
ATGTTTGTTTACGAGTTTATGGCTGGAATAGTTGAGCATATAACTGGAGGCGAGGAAGAGGAATCTTTCGACCCACATGAACACCGCAGAGTAGAGAGACCGACGACTTATTCAGACACGATGACGCATCTGCTGAAGGGTAGTATAGGAGCTGGCATCCTCGCCATGGCGGACGCGGTCGCCCGTGTCGGAATTGTTTTCAGTATTTTTGGCATTTTAATGATTGGATCCTTTGCTACTTACTGCATACAACTTTTAATAGCAACTCAGTATAAATTGTGTAAGAGATTCAAACGCGGTTATCTGGCCTATCCTAAATCCATGCTTTTTGCTATCCAAGAAGGACCCCCGTGCCTCAGGTGGTCCGCCAGATCACTTTATTACTTTGTTGATTCTGTGTTGATCCTTTGGCAACTCGGTATCTGCTGTATATACTGTGTCTTTGTCGCTGAAAACATAAAGCAGGTTTGTGATTTCCACGGACAAGTAATGTCTTTGAGAACACACCTTTTTTTTCTGTTATTGCCGCTCACGCTCATGGGACTCGTGAAAAACCTCAAACTGTTGACTCCATTTTCTTCTATATCAAACATAGTTACTATATTTGGGTTTGTTCTTGTCTTCTTTTATTTAATTGAAGATGATGTTACTATAGAAGATGAAAAGTTACAATTGAAGGGACTCGAAGAGATTCCATTCTTTATTGGCACGACATTGTTTGCCCTCGAAGCTGTGGGTGTGGTCCTGGCTTTGGAATACAACATGGAGCAACCAAAACGTTTTGTAGGACTCTTTGGTCTTTTCAACATTGGCATGGTTATCATTATGTCACTCTATTTGCTGATGGGGATTTTTGGTTATCTCAAGTATGGAGACGAGATAAAAGCGTCTATAACCTTGAATCTACCCCAAAATCAAAAGAAAGCGCAAGCAGCAAAAGTGATATTTGCAATGGCAATATTTTTGACATTTCCACTGCAGAATTTTGTTGCCTATAGCATTATCTATCGTAAGATACACAAAAAAGTATCAGGAACGAAACTTTTAATTTTAGATTACTTACTACGTGTAGCACTCGTAGTTCTTCCTTGGTTGGCTGCAGTAGCTGTGCCGAAACTGGGACCATTCATAGCTTTGTTCGGTGCTTTCTGTCTGTCCCTTCTATCTATGGTGTTTCCCGGCATCATGGACGCCTGCGTCTGGTACACTGATAGTTACGGCCTGTGCCGCTACCGACTCATTCGTGATATATTCATCGTGTTAATCGGCCTCGCATTTCTCATCTCTGGTTGCTACACAAGCTTACTAGAAATTGCTGCTTCTTCCGACCACTGA

Protein sequence:

>DPOGS203302-PA
MFVYEFMAGIVEHITGGEEEESFDPHEHRRVERPTTYSDTMTHLLKGSIGAGILAMADAVARVGIVFSIFGILMIGSFATYCIQLLIATQYKLCKRFKRGYLAYPKSMLFAIQEGPPCLRWSARSLYYFVDSVLILWQLGICCIYCVFVAENIKQVCDFHGQVMSLRTHLFFLLLPLTLMGLVKNLKLLTPFSSISNIVTIFGFVLVFFYLIEDDVTIEDEKLQLKGLEEIPFFIGTTLFALEAVGVVLALEYNMEQPKRFVGLFGLFNIGMVIIMSLYLLMGIFGYLKYGDEIKASITLNLPQNQKKAQAAKVIFAMAIFLTFPLQNFVAYSIIYRKIHKKVSGTKLLILDYLLRVALVVLPWLAAVAVPKLGPFIALFGAFCLSLLSMVFPGIMDACVWYTDSYGLCRYRLIRDIFIVLIGLAFLISGCYTSLLEIAASSDH-