Monarch geneset OGS2.0

DPOGS211114
TranscriptDPOGS211114-TA1440 bp
ProteinDPOGS211114-PA479 aa
Genomic positionDPSCF300007 - 629619-645095
RNAseq coverage678x (Rank: top 19%)
Annotation
HeliconiusHMEL0124330.075.20% 
BombyxBGIBMGA003178-TA2e-12847.79% 
DrosophilaJhI-21-PC0.064.26% 
EBI UniRef50UniRef50_Q9VKC20.064.26%Amino acid transporter protein JHI-21 n=19 Tax=Pancrustacea RepID=Q9VKC2_DROME
NCBI RefSeqXP_002020872.10.065.47%GL14136 [Drosophila persimilis]
NCBI nr blastpgi|1259867800.065.47%GA11552 [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastxgi|1953866280.065.19%GJ24006 [Drosophila virilis]
Group
Gene OntologyGO:00160208.8e-255membrane
GO:00033338.8e-255amino acid transmembrane transport
GO:00151718.8e-255amino acid transmembrane transporter activity
GO:00068101.5e-32transport
GO:00550851.5e-32transmembrane transport
KEGG pathway 
InterPro domain[2-468] IPR0022938.8e-255Amino acid/polyamine transporter I
[26-368] IPR0048411.5e-32Amino acid permease domain
Orthology groupMCL18443 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211114-TA
ATGGGTGAAGAAACTCAAGCCGGCGACGGAAAAGTAACATTAAAACGTAAAATAACATTGTTCAATGGTGTCGGAATTATTATTGGGACGATTATTGGCTCTGGAATTTTTATATCGCCGACCGGTGTCTTCCGGTATACTCAGTCAGTTGGAGCCTCGCTTCTTATATGGCTGATATGTGGCCTGCTCTCGACATTAGGAGCCCTATGTTACGCAGAGTTGGGAACTTCGATATCAAGATCTGGGGGCGATTATGCCTACATCTTCACAGCGTTTGGGCCGCTGCCTGCGTTCTTGAGAATGTGGATAGCGCTCCTCATCATAAGACCGACCACGCAAGCGATAGTAGCCATAACCTTCGGCCAATACGTCGTTAAACCTTTCTTCCCCGACTGCGAGCCTCCAGAAAACGCTGTGAAACTCCTAGCAGCGGTGTGTCTCTGTATACTGACAGCCATAAACTGCATCAGCGTCAGATGGACAATGCGTATTCAAGACGTGTTCACTTCATCCAAACTACTGGCGCTGGTCGTTATCATAATATCCGGGATCTATTACATAGCGAGCGGACATACAGAAAATTTTGAAAGGGCATTTGACGGTGAGTACAGCGCTGGTGACATTGCCTTGGCGTTCTACTCTGGTCTGTTCGCTTTCGGTGGTTGGAATTATCTCAACTTCGTCACAGAGGAACTCCAAGATCCTTATAAGAATCTCCCGCGAGCTATATGGATAGCCATGCCAATGGTGACCACTATTTACGTTATGGCCAACTTGGCATACTTCGCCGTCGTCACCAAGACACAATGGCTGGACCCGAAAGCTGTTGTTGCAGCGATCTTCGGGGACCAGCTGTTCGGTAGCTGGAGCTGGTTGATTCCCGTGTTCGTGGCGTTATCAACATTCGGTGGTGTGAACGGGGTGCTGTTCACGTCAGCGAGACTGTTCGCCACCGGGGCCCAGGAGGGTCACATGCCTGGCTTCTTCACGCTGTTCCACGTCGAGAAACAAACTCCCATACCATCACTTATACTGACATGTTTCTTCTCACTTCTCATGTTGACCACCAGCAACGTCATCGAGCTGATAAACTATTACTCACAAACCCTGTGGCTGTCCGTGGGCGCGTCTGTCGTGGGCATGTTGTGGCTTCGGCGGACCAAACCTGAAATGTCCAGACCGATCAGGGTCAACATCGTCATACCATACCTGTTCCTCGTAGCTATCGGCTGTTTAGTCATAATTCCCGCCATAACACAACCTAAAGACACTGCCATCGGTATTGCCATACTCCTGTCGGGGATACCAGTTTATTATCTATGCGTCAAATGGCAGAATAAACCTCAATGCTACAATACGGCTTCCGGCTGCATACTGAGATTCCTTCAGAAGTTATGTTCCTGCGCTTATGTGGATTCATCAGAAAAAATAGCGAATTGA

Protein sequence:

>DPOGS211114-PA
MGEETQAGDGKVTLKRKITLFNGVGIIIGTIIGSGIFISPTGVFRYTQSVGASLLIWLICGLLSTLGALCYAELGTSISRSGGDYAYIFTAFGPLPAFLRMWIALLIIRPTTQAIVAITFGQYVVKPFFPDCEPPENAVKLLAAVCLCILTAINCISVRWTMRIQDVFTSSKLLALVVIIISGIYYIASGHTENFERAFDGEYSAGDIALAFYSGLFAFGGWNYLNFVTEELQDPYKNLPRAIWIAMPMVTTIYVMANLAYFAVVTKTQWLDPKAVVAAIFGDQLFGSWSWLIPVFVALSTFGGVNGVLFTSARLFATGAQEGHMPGFFTLFHVEKQTPIPSLILTCFFSLLMLTTSNVIELINYYSQTLWLSVGASVVGMLWLRRTKPEMSRPIRVNIVIPYLFLVAIGCLVIIPAITQPKDTAIGIAILLSGIPVYYLCVKWQNKPQCYNTASGCILRFLQKLCSCAYVDSSEKIAN-