Monarch geneset OGS2.0

DPOGS214745
TranscriptDPOGS214745-TA1161 bp
ProteinDPOGS214745-PA386 aa
Genomic positionDPSCF300022 + 672764-676595
RNAseq coverage269x (Rank: top 40%)
Annotation
HeliconiusHMEL0060051e-17374.02% 
BombyxBGIBMGA004739-TA9e-12073.80% 
DrosophilaCG9413-PB5e-5439.14% 
EBI UniRef50UniRef50_Q7Q6V28e-9656.79%AGAP005653-PA n=16 Tax=Endopterygota RepID=Q7Q6V2_ANOGA
NCBI RefSeqXP_971788.14e-9856.35%PREDICTED: similar to cationic amino acid transporter [Tribolium castaneum]
NCBI nr blastpgi|910782808e-9756.35%PREDICTED: similar to cationic amino acid transporter [Tribolium castaneum]
NCBI nr blastxgi|1582945482e-9656.75%AGAP005653-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00160206e-105membrane
GO:00033336e-105amino acid transmembrane transport
GO:00151716e-105amino acid transmembrane transporter activity
GO:00068108.1e-13transport
GO:00550858.1e-13transmembrane transport
KEGG pathway 
InterPro domain[98-386] IPR0022936e-105Amino acid/polyamine transporter I
[72-261] IPR0048418.1e-13Amino acid permease domain
Orthology groupMCL16735 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214745-TA
ATGAAGAATAAGACGAAACAAGCTGAAGAACTCACTGCCTTCACTAGGGGAAACATTTTTGATATAGATAACGAAATACAAAGGTTGAATAACCTAACAAAAATGTCGGAAGAAGCTCTCTCTAGGCCGAGAATTGTGCAAAAGGAAATTATTCCAGAGCATTTAGACAAGGCTAAGAAAACCTTTCACCTACTTAAGACTGAACTGCATGGACTTATACAGTCATTGTTTCCTCATGCTAGCGATCAAATGATGGATGTCATGGGGGTAAGGGTTTCGTTACATGTACAAGTATTTATAATGACGTACATAAACATCACAAGTGTGAAGCTGTTTGTGAAAGTGCAGAATATATTTGGAGTTTGCAAAGTATTTGCTTGTCTTATTGTCATCGGCGGTGGGATATATGAAATAGCTAAAGGCAACACTGAGAACCTCAGTAAAGGATTTGAGGGTAGCACGAACAGCGCGGGCGGCATAGCGCTCGCACTATACTCAGGTTTATGGGCCTATGATGGATGGAACAGCGTCACTGTTGTCACAGAGGAGATCATTAACCCTGGCGTCAACGTTCCTCTCAGCATATCGATCGCCGTGCCCCTTATCACGGGGCTGTATGTGTTCATGAACGTGGCCTACATGACAGTGTTGAGTTACTCCGAGATGATCTCAGTACCAGCAGTAGCGGTCGCCTTCGGTGCTAGAGTATTGGGACCTTTCAGCTTCATTATACCACTCGGTGTTGCCATTGCAACCTTCGGATGCGCCATGAGTGCCTTTCTAACATCCATATTCATATCCGTGGGTAACATAAAGACTCTCATAGACTTCGCTAGTTGGTTTCTCTGGTTCTTCTACGGCCTGGCGATGGTCGCGTTGTTGGTGCTGAGGAAGACGAAGCGGGACACCCCCAGACCCTACCGCGTGCCCACACTAGTCCCCTGCTTCGTATTACTGGTGGCGATATTCTTGTCCATACTACCGATAGTGCACGACCCCTCAGTCAAATATTTAATGGCTATCGGGTTCATGGCTCTGGGCGTTGTTGTATACACCGTGTTCGTGTATTATAAGAAAACGCCAACAAAGTTACTTAATAAATTCACATTCCTGACGCAAGTGCTCTTCGAGAGTGTGCCGCCTAGCAACCATGAAGACTGA

Protein sequence:

>DPOGS214745-PA
MKNKTKQAEELTAFTRGNIFDIDNEIQRLNNLTKMSEEALSRPRIVQKEIIPEHLDKAKKTFHLLKTELHGLIQSLFPHASDQMMDVMGVRVSLHVQVFIMTYINITSVKLFVKVQNIFGVCKVFACLIVIGGGIYEIAKGNTENLSKGFEGSTNSAGGIALALYSGLWAYDGWNSVTVVTEEIINPGVNVPLSISIAVPLITGLYVFMNVAYMTVLSYSEMISVPAVAVAFGARVLGPFSFIIPLGVAIATFGCAMSAFLTSIFISVGNIKTLIDFASWFLWFFYGLAMVALLVLRKTKRDTPRPYRVPTLVPCFVLLVAIFLSILPIVHDPSVKYLMAIGFMALGVVVYTVFVYYKKTPTKLLNKFTFLTQVLFESVPPSNHED-