Monarch geneset OGS2.0

DPOGS206040
TranscriptDPOGS206040-TA1431 bp
ProteinDPOGS206040-PA476 aa
Genomic positionDPSCF300028 - 1341970-1346175
RNAseq coverage743x (Rank: top 17%)
Annotation
HeliconiusHMEL0066303e-13382.78% 
BombyxBGIBMGA000714-TA0.071.10% 
Drosophilapath-PC4e-14057.11% 
EBI UniRef50UniRef50_E2A7X22e-14558.09%Proton-coupled amino acid transporter 4 n=9 Tax=Endopterygota RepID=E2A7X2_CAMFO
NCBI RefSeqXP_001687875.12e-15358.12%AGAP007633-PD [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3227997868e-15858.75%hypothetical protein SINV_05703 [Solenopsis invicta]
NCBI nr blastxgi|3227997863e-15758.46%hypothetical protein SINV_05703 [Solenopsis invicta]
Group
KEGG pathway 
InterPro domain[56-464] IPR0130571.9e-66Amino acid transporter, transmembrane
Orthology groupMCL15535 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206040-TA
ATGCAGGAGTCAAATGGGAACGTGGCTCCTCCCCAGGAGTTGGAGACGTTCCTTCCACAAGACGAGAAGAAAGACAAGGTTGAGAAAAAATATAACCTAACTAAAGAAAAAGATGTCGAAGAGGGTGATTACGATCCGTTTGCAGAAAGAAAATTGGACAATCCGACCTCCAATATGGACACACTGACTCACTTACTGAAGGCGTCTTTGGGTACTGGTATTCTAGCTATGCCAAAAGCCTTTCAGTGTTCAGGGCTTTTGGCGGGAATTTTCTTCACGATTTTGGTCGCTGTAGTATGCACTCACTGCGCATACGTCCTTATAAAATGCGCACACGTACTTTACTACAAGACGAAAAAACCAACAATGAGCTTTCCGGAAGTTGCGGAGGCGGCCCTGGATAACGGTCCCCAATGGGGAAGAAGATGGGCATATACTTTTAGGATCTTCATCTTGGTCAGTCTGTTCATAACGTACTTCGGTACGTGTTCGGTGTACGCGGTTATAATTGCTGAAAATATTAAAAAGGTAGTTCATTTCTATTGGGAAAGCACCCAAGAAAACTTCGGGATACGAATATTTATCCTCCTAATTCTCCCACTGCTAATCTTTATGGCATGGATCAAGAATCTGAAATATTTGGCGCCGGTCTCAATGATAGCAAATTTATTTATGGCGGTGGGCCTCGGGATAACGTTTTATTTCCTCGTCGGCACCGAGTCCTTGGATTTCGGGAAAGTTGCAGCAGTGAAACATCCCAGCGAATGGCCGCAATTTTTCTCCCTCACAATCTTTGCCATGGAAGCAATCGGTGTCGTGATGCCTTTAGAAAATTCGATGAAAACTCCGCGCTCTATGCTTGGATTCTGCGGGGTTCTGAACAAGGGGATGTCTGGTGTGACCTTGGTGTACATTCTTCTTGGATTCCTTGGTTACCTCCGCTACGGAGAGCTGGTACAAGATTCGATCACGCTCAACTTGGAACCGCACCCCGACGATCCTAAGATCTATGAAGTTCTCGCCCAAACCGTAAAAATTTCCATCGCCATCGCCGTGTACTGCACATTTGGGCTCCAATTCTTCGTCTGCATCGAAATCATGTGGAACTGCATGAAGGACAAGTTCACTCAGCGGCCGGACCTCGCGGACTACGTGATGCGCACCATCCTAGTCACAGTGTGCGTTCTCCTGGCCGTGGCCGTGCCCACCATAGGTCCGTTCATGGGCGTCATCGGCGCGTTCTGCTTTTCTATCCTCGGCCTCATCGCTCCCGCTTTCATAGAAATCATAACCTTCTGGGACATCGGTTTCGGTCCTTACAAATATCTCATATGGAAAAATTTACTCGTACTAATCTTCGGCCTGTTCGCTCTCATTTTCGGCACCATAGATGCGTTCAAAAGCATAATATCCGTGTACAGCGCACACTAG

Protein sequence:

>DPOGS206040-PA
MQESNGNVAPPQELETFLPQDEKKDKVEKKYNLTKEKDVEEGDYDPFAERKLDNPTSNMDTLTHLLKASLGTGILAMPKAFQCSGLLAGIFFTILVAVVCTHCAYVLIKCAHVLYYKTKKPTMSFPEVAEAALDNGPQWGRRWAYTFRIFILVSLFITYFGTCSVYAVIIAENIKKVVHFYWESTQENFGIRIFILLILPLLIFMAWIKNLKYLAPVSMIANLFMAVGLGITFYFLVGTESLDFGKVAAVKHPSEWPQFFSLTIFAMEAIGVVMPLENSMKTPRSMLGFCGVLNKGMSGVTLVYILLGFLGYLRYGELVQDSITLNLEPHPDDPKIYEVLAQTVKISIAIAVYCTFGLQFFVCIEIMWNCMKDKFTQRPDLADYVMRTILVTVCVLLAVAVPTIGPFMGVIGAFCFSILGLIAPAFIEIITFWDIGFGPYKYLIWKNLLVLIFGLFALIFGTIDAFKSIISVYSAH-