Monarch geneset OGS2.0

DPOGS203005
TranscriptDPOGS203005-TA1368 bp
ProteinDPOGS203005-PA455 aa
Genomic positionDPSCF300068 + 75133-83844
RNAseq coverage4185x (Rank: top 3%)
Annotation
HeliconiusHMEL0110170.081.32% 
BombyxBGIBMGA003864-TA1e-16172.36% 
DrosophilaCG13646-PA1e-2026.04% 
EBI UniRef50UniRef50_G1UH291e-17767.99%Similar to amino acid transporter n=4 Tax=Bombyx mori RepID=G1UH29_BOMMO
NCBI RefSeqXP_002422893.15e-9144.34%vacuolar amino acid transporter, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3796989384e-17767.99%os protein [Bombyx mori]
NCBI nr blastxgi|3796989380.069.03%os protein [Bombyx mori]
Group
KEGG pathway 
InterPro domain[48-414] IPR0130576.5e-61Amino acid transporter, transmembrane
Orthology groupMCL22052 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203005-TA
ATGTCTAAGAAAAATGGTTTGGAGGTCTCGTTGTCGGCCGGCAAGTCGCCTGCGACGGAGTCTACACCCCTCGTTCCTAAAGTGGGTATCGAAGAAGGTGGCAGCGGTGGAGAATCCAGCAAGAATGGCATCAGCGGAGGTCTGTCGATGAACCAAACAGCTTTTCTGATCGCTGGTGAGCTGGTTGGCAGCGGTGTCCTGGCTCTTCCGAAGGCTGTCGTCAAAACCGGATGGGTTGGCATTCCCTTGATAGTGCTGATGTGCCTCCTGGCTGCCTTCAGCGGCAGGAGGTTGGGAGACTGCTGGACCATCATCGAGAGCCGAGACCCTGAAATGAGAACCAGGAAAAGAAACCCTTACGCCATAATAGCTGAACAGTCTCTTGGAAAATTCTGGAGCGTGGGTGTATCTCTCGCCATGATAGTGACGCAGTTTGGCGTGGCGGTAGTTTATCTACTGCTGGCTGCTCAGATCATTGAGCAAGTGTTCCTCTCCCTCATGCCCACCGTAACGATCTGCATCTGGTACCTAGTGGTGGTGGGGGCTATGACCCCACTCACTCTTTTCGGCACGCCCAAAGATTTCTCCTTCTTGGGAGTGATTGCTTTCTTCGCGGCGGTGGTAGCATGTGTCCTGTACTTCATACAAATGATGAACGACATCAGACCTTACCCCGTATTCCGTTGGGGCATCCACGGCTTCACGGACTTCTTCCTCGCCTTCGGCACCATCATGTTCGCTTTCGGTGGAGCGTCCACATTCCCGACTCTTCAGAACGACATGGCCGACAAGACCAAGTTCAACAAGAGCCTGCAGTACGGATTCATTGCAATCTTGGCCATGTATTTGCCCATCGCGATCGCGGGCTATGCGATCTACGGTGAGTCTGTGGGACCAAACTTCGCTACATCACTGTCCGCGACCCCCCTGTCTCTGGTCGGCAATGTCATGATGGCTATCCACCTGGTCTGTGCCTTCGTCATCCTCATCAACCCCGTCTGCCAGGAGATGGAGGAGCTCTACAACATCAACAGTGACGCCATCGGCTACCGTACGCTCGTACGTTTCTCCATCATGGCTGGTATACTGTTCATCGGGGAGAGCATCCCTCGCTTCTACACCATCCTAGCGTTTGTGGGGGCTACCACCATCGCTCTACTCACCTACGTGCTACCCTCCTACTGCTACCTGAACCTTGTCAATCAGCCACCAAGGGAAGGACAGGCGCCACTTGAGGTAGCGGGATGGGTTAAGCTGGTCTGTTGGGAAGTGTTGGTCATTGGTATCTTGGGAGGCGCGGCGGCTACCTACAGCGCTCTTAGTGCCATCTTTGGCACTGCTCAGGCCGTTCCGTGCTACCTCAAGTAG

Protein sequence:

>DPOGS203005-PA
MSKKNGLEVSLSAGKSPATESTPLVPKVGIEEGGSGGESSKNGISGGLSMNQTAFLIAGELVGSGVLALPKAVVKTGWVGIPLIVLMCLLAAFSGRRLGDCWTIIESRDPEMRTRKRNPYAIIAEQSLGKFWSVGVSLAMIVTQFGVAVVYLLLAAQIIEQVFLSLMPTVTICIWYLVVVGAMTPLTLFGTPKDFSFLGVIAFFAAVVACVLYFIQMMNDIRPYPVFRWGIHGFTDFFLAFGTIMFAFGGASTFPTLQNDMADKTKFNKSLQYGFIAILAMYLPIAIAGYAIYGESVGPNFATSLSATPLSLVGNVMMAIHLVCAFVILINPVCQEMEELYNINSDAIGYRTLVRFSIMAGILFIGESIPRFYTILAFVGATTIALLTYVLPSYCYLNLVNQPPREGQAPLEVAGWVKLVCWEVLVIGILGGAAATYSALSAIFGTAQAVPCYLK-