Monarch geneset OGS2.0

DPOGS204403
TranscriptDPOGS204403-TA1668 bp
ProteinDPOGS204403-PA555 aa
Genomic positionDPSCF300002 - 1033507-1047069
RNAseq coverage494x (Rank: top 25%)
Annotation
HeliconiusHMEL0156810.097.57% 
BombyxBGIBMGA003178-TA1e-9842.07% 
DrosophilaCG9413-PB0.070.31% 
EBI UniRef50UniRef50_P822514e-14455.14%B(0,+)-type amino acid transporter 1 n=30 Tax=Chordata RepID=BAT1_HUMAN
NCBI RefSeqXP_972107.20.074.72%PREDICTED: similar to AGAP001870-PA [Tribolium castaneum]
NCBI nr blastpgi|2700017100.074.16%hypothetical protein TcasGA2_TC000583 [Tribolium castaneum]
NCBI nr blastxgi|2700017100.075.86%hypothetical protein TcasGA2_TC000583 [Tribolium castaneum]
Group
Gene OntologyGO:00160201.8e-288membrane
GO:00033331.8e-288amino acid transmembrane transport
GO:00151711.8e-288amino acid transmembrane transporter activity
GO:00068101.6e-23transport
GO:00550851.6e-23transmembrane transport
KEGG pathway 
InterPro domain[79-550] IPR0022931.8e-288Amino acid/polyamine transporter I
[107-490] IPR0048411.6e-23Amino acid permease domain
Orthology groupMCL14910 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204403-TA
ATGAGCGATGCGATGCGCGAAAAGCTCGCGAGACTACTTCGCGGCTCTATCGCTGCGCAGACGCATACAGTCACCACGGACAAATTTCAAAACGGCACCACTCATCCTGTACTGCTTTATTCTGATGAAGGGAGCGCACGCGAAGGCGGATTGGTATGGCGCGGGTGTTCCGGGGACTGCGACGCGGAAGATGGCTCGGCGGGAACGCTCGCCGATGGCAACGCCAACCCAGGAGACAAGTTGGAGGGGTCGGACGCAGCTCCGGACGACCCTGTTCACCTCAAGAGACGGGTGGGACTCTTCAGTGGGGTGGCTCTAATCGTCGGTACTATGATCGGTTCTGGAATATTCGTGTCGCCCTCTGGGCTTTTGGAGCGTACAGGCTCAGTGGGCATAAGCTTCATTATATGGATGGCGTGTGGGCTGCTCTCGCTGCTTGGAGCGCTCGCGTACGCGGAGTTAGGAACGATGAACACCTCCTCTGGAGCTGAGTACGCATACTTCATGGACGCTTTCGGTGGACCACCGGCGTTCCTCTTCTCTTGGGTGTCAACGCTGGTGCTGAAGCCGTCTCAGATGGCCATAATCTGTTTGAGCTTCGCAAAATATGCGGTGGAGCCCTTCGTGTCAGAGTGCGAACCGCCTGACGCTCTCGTCAAACTTGTGGCACTTATATCTATTGTGATGATTCTTGCCGTCAACTGCTACAGTGTCAATCTAGCGACGAACGTCCAAAATATTTTTACGGCTGCGAAACTGGTCGCCATCGCGATAATCGTCTGTGGAGGAGCTTACAAACTCATTTTAGGTAATACGCGACATTTACAGGAGCCCAACTTCGCAAGTAGCACCGCGACGCTTGGCAACATCGCCACGGCCTTCTACACCGGGCTGTGGGCCTACGATGGATGGAATAACCTTAATTATGTTACAGAGGAAATTAAAAATCCTTCCAAGAACCTGCCTCTGAGCATAATAATTGGCATTCCGCTGGTGACGCTGTGCTACGCCTTGGTGAACGTGTCGTACTTGGCGGTGATGTCCGTGAGTGAGATGGCGGACAGTGAAGCGGTCGCTGTGACCTTCGGGAACAGACTGTTGGGCCCCATGGCGTGGCTCATGCCGCTGGCTGTTACTATATCAACTTTCGGCTCGGCGAATGGGACTCTATTTGTTGCGGGAAGGTTGTGCTTCGCAGCATCTCGGGAGGGACATTTATTGGATATACTTTCGTATGTCCATGTACGTCGGTTTACACCCGCCCCGGGACTTATATTCCATTCTCTGATAGCGGTGGCGATGGTGCTGTACGGAACCATAGATTCGTTAATTGATTTCTTCTCGTTCACTGCCTGGATATTCTACGGTGGGGCCATGCTGGCGTTGATTGTGATGAGATACACCAAGCCTCACGCGCCTAGACCATACAAGGTGCCGATTATAATTCCCTACATAGTCCTGATCGTGTCCGCGTACTTGGTGGTCGCTCCGATCATAGACAACCCTCAGTGGGAGTACTTGTACGCGGGAGCTTTCATCCTCGCCGGCCTGCTGGTCTACCTGCCGTTCGTCAAGTGGGGATACTCTCTTCCCTTCATGGATAAAATTACAGTGTTTCTTCAGATGGTTCTAGAAGTGGTGCCAACGTCGACGACTTTTGAATATTGA

Protein sequence:

>DPOGS204403-PA
MSDAMREKLARLLRGSIAAQTHTVTTDKFQNGTTHPVLLYSDEGSAREGGLVWRGCSGDCDAEDGSAGTLADGNANPGDKLEGSDAAPDDPVHLKRRVGLFSGVALIVGTMIGSGIFVSPSGLLERTGSVGISFIIWMACGLLSLLGALAYAELGTMNTSSGAEYAYFMDAFGGPPAFLFSWVSTLVLKPSQMAIICLSFAKYAVEPFVSECEPPDALVKLVALISIVMILAVNCYSVNLATNVQNIFTAAKLVAIAIIVCGGAYKLILGNTRHLQEPNFASSTATLGNIATAFYTGLWAYDGWNNLNYVTEEIKNPSKNLPLSIIIGIPLVTLCYALVNVSYLAVMSVSEMADSEAVAVTFGNRLLGPMAWLMPLAVTISTFGSANGTLFVAGRLCFAASREGHLLDILSYVHVRRFTPAPGLIFHSLIAVAMVLYGTIDSLIDFFSFTAWIFYGGAMLALIVMRYTKPHAPRPYKVPIIIPYIVLIVSAYLVVAPIIDNPQWEYLYAGAFILAGLLVYLPFVKWGYSLPFMDKITVFLQMVLEVVPTSTTFEY-