Monarch geneset OGS2.0

DPOGS203336
TranscriptDPOGS203336-TA1404 bp
ProteinDPOGS203336-PA467 aa
Genomic positionDPSCF300003 - 270494-274447
RNAseq coverage152x (Rank: top 53%)
Annotation
HeliconiusHMEL0179380.084.15% 
BombyxBGIBMGA003856-TA0.081.80% 
Drosophilagb-PA2e-16260.13% 
EBI UniRef50UniRef50_Q9VB753e-16060.13%GH08870p n=43 Tax=Pancrustacea RepID=Q9VB75_DROME
NCBI RefSeqXP_001989720.13e-16460.98%GH18946 [Drosophila grimshawi]
NCBI nr blastpgi|1950365246e-16360.98%GH18946 [Drosophila grimshawi]
NCBI nr blastxgi|1571296877e-16562.31%amino acids transporter [Aedes aegypti]
Group
Gene OntologyGO:00160201.2e-235membrane
GO:00033331.2e-235amino acid transmembrane transport
GO:00151711.2e-235amino acid transmembrane transporter activity
GO:00068105.6e-27transport
GO:00550855.6e-27transmembrane transport
KEGG pathway 
InterPro domain[4-467] IPR0022931.2e-235Amino acid/polyamine transporter I
[20-382] IPR0048415.6e-27Amino acid permease domain
Orthology groupMCL16347 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203336-TA
ATGAGTGAAGAACGGGTTAAAATGCGCAAACAACTAGGGTTGCTGGAAGGGGTTGCAATTATACTTGGTATTATATTCGGCAGTGGTATTTTTATCTCCCCCAAAGAAGTTTTAGAAAAAACTGGGTCGGTTTGGGGAGCTCTATCAGTTTGGGCGGTCTGCGGAGTTTTGGCGACACTTGGTGCCATGTCATATGCAGAATTAGGGACAGCATTAGCTAAGAGCGGCGGTGATTATCACTATATAAACGAGGCCTATGGCTCCTTACCAGCTTTCCTGTACCTCTGGGATGCAAATTTGGTTTTCGTACCGAGTACAAATGCAATAATGGCGCTAACATTTGCTAATAATCTACTCGAACCGATATTTCCAAATTGTACTATAGACCCTTTGAGTACGAAGCTTATAGCTGCAGTAACTATATGTTTCCTAACATTCATCAATGCATACGATGTGAGATTCACAACCAGAATACAAAATGTCTTTATGTTCACAAAGATCTCTGCTTTGGTTGTCATTATTGTTGGTGGAATTGTCTGGATGGGAAGAGGTGGTGTCGAAAACTTTGATGATGGCTGGGCAGGTACTAAGACGTCCATCAGTGATTGGTCTGTAGCTTTCTACTCCGGAATCTTTTCATATTCCGGCTGGAATTACCTGAACTTTATGACAGAAGAACTCCGAGATCCCTATGTGAATCTTCCTCGTGCCATTTATTTGTCCCTGCCCCTGGTAACTGCTATATACATACTAGCGAATGTCTCATACATGGCCGTGCTCGGACCATCTGGTGTCAGAGCTACCAAAGCTATAGCAGTTGACTTTGCAGGATCAGCTCTTGGGTCTATGAAGTGGGCGATGCCGACCTTAGTGGCTATTGCTATACTCGGAGGCCTGTCAGTTCATATCATGACGTCATCGAGGATGTGCTTCGCTGGAGCCCGTAATGGTCACATGCCAGCCCTGTTAGCTCACATTAATGTTAAATGTATGTCACCGATGCCGTCGCTCGTGTTCTTGATGCTGATCTCTCTGCTCATGTTGATCCCAAGCAATCTGACGTCTCTAATAACGTACTGCACAGTGGTCGAGTCGTTTTTCACAACGCTAAGTTGTAGCGCTGTGCTGTGGCTGAGATACAAGAGACCAGACATAGTGAGACCTATCAAGGTGTCTCTGTGGATGCCGGTGGTGTTTGTCACAATATGCACAGTGTTGCTAGTTGTGCCAATAGTTAGTGAACCGGTGGCAGTGTTGGCCGGTGCGTTTATAACTCTAGCTGGGGTTCCCGTGTATTTTTTGCTGGTGAGGAGTAAGCCTGAGCCAGTAGTTCAACTATCGAACAAATTTACACTTCTGTGTCAGAAGTTGTTCCTATCCAGCGTCGAAGATAAGGAAGATTAA

Protein sequence:

>DPOGS203336-PA
MSEERVKMRKQLGLLEGVAIILGIIFGSGIFISPKEVLEKTGSVWGALSVWAVCGVLATLGAMSYAELGTALAKSGGDYHYINEAYGSLPAFLYLWDANLVFVPSTNAIMALTFANNLLEPIFPNCTIDPLSTKLIAAVTICFLTFINAYDVRFTTRIQNVFMFTKISALVVIIVGGIVWMGRGGVENFDDGWAGTKTSISDWSVAFYSGIFSYSGWNYLNFMTEELRDPYVNLPRAIYLSLPLVTAIYILANVSYMAVLGPSGVRATKAIAVDFAGSALGSMKWAMPTLVAIAILGGLSVHIMTSSRMCFAGARNGHMPALLAHINVKCMSPMPSLVFLMLISLLMLIPSNLTSLITYCTVVESFFTTLSCSAVLWLRYKRPDIVRPIKVSLWMPVVFVTICTVLLVVPIVSEPVAVLAGAFITLAGVPVYFLLVRSKPEPVVQLSNKFTLLCQKLFLSSVEDKED-