Monarch geneset OGS2.0

DPOGS202676
TranscriptDPOGS202676-TA1443 bp
ProteinDPOGS202676-PA480 aa
Genomic positionDPSCF300039 + 705361-709850
RNAseq coverage261x (Rank: top 41%)
Annotation
HeliconiusHMEL0080690.087.89% 
BombyxBGIBMGA000844-TA0.075.68% 
DrosophilaCG8785-PB7e-13854.17% 
EBI UniRef50UniRef50_Q7K2W31e-13554.17%CG8785, isoform A n=18 Tax=Endopterygota RepID=Q7K2W3_DROME
NCBI RefSeqXP_001861307.19e-14454.23%amino acid transporter [Culex quinquefasciatus]
NCBI nr blastpgi|1700504282e-14254.23%amino acid transporter [Culex quinquefasciatus]
NCBI nr blastxgi|1700504281e-13954.23%amino acid transporter [Culex quinquefasciatus]
Group
KEGG pathway 
InterPro domain[56-457] IPR0130579.5e-69Amino acid transporter, transmembrane
Orthology groupMCL10684 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202676-TA
ATGGGTCACGACGAGAAGAAGGGCACCATTATTTTAGACAACTTCAACTCGACAGCTAATTTAACAGCTAATCCTGGATTCCAGTCTACTCTGAGCCTCGGTTCAAAAGATGTTATTAATGAGAAAGCGTACAATCCTTTTGAACACAGAAAAGTGTCTCATCCAAATTCAACTATTGGCTCCCTCGTACATTTATTAAAATCATCACTTGGCTCTGGTATCCTGGCGATGCCGGCAGCTTTCAAAAATGCTGGATTAGCCGTTGGAGCTTTCGGAACGATTATTATTGGTTTTATTTGTACGCACTGCGTGTATGTTCTTGTTAAAACATCTCAGGAAGTTTGTGTGGAAGCGAAAAAGCCGTCAATGGGATTCGCGGAGACCTGCGGAGCGGCTTTCGAATTCGGACCAAAAAAACTAAGGCCTTGGGCGAATTTTGCAAGAACTTTCATCGATTACACATTAACTTGTACTTATCTGGCTGCTTTGTGTGTTTACGTTGTGTTCATCGCTGAGAACTTTAAAGAAGTTCTTGATGAATATTATCCCGAATACAAACTCTCCGTGGAGGCGTACTGTGCGCTGACCCTTGTTCCTCTCGTTCTAATCTGTCAAATAAGGAATTTGAAATGGCTGGTGCCATTTTCTGCTGTAGCAAACATCTTCTTAGTGATCTGCTTTGCCATTACTATGTATTACATATTCGATGATTTGCCCAATCCTGCCGAAAGGCAAATGGTTGCTAGTTTTACGCAGTGGCCATTATTTATAAGTACCGTTATCTTCGCTATGGAAGGCATCGGAGTGGTGATGCCAGTTGAAAACGAGATGGCGAAACCACAACAGTTTCTGGGATGCCCTGGAGTTCTTAACGTCGCCATGACAATCGTAATTTCCTTGTACGGTATTGTCGGTTTCTTTGGATATATCAAGTATGGAGACACTGTACGCGGAAGTGTTACATTAAATCTCCCACAAGACGAAATTTTGGCCCAAAGCGCGAAGATCTTGATGGCTCTCGCTATTCTATTTACTTATAGTCTACAGTTCTACGTGCCAATGGAAATGATCTGGCGTGAATTGCACTCTAAGATCTCTATAAAATACCACAACTTCATGCAAATTACTATCAGAACTACCGCTGTCGTAGGATCTGTTGCTATTGCTGCCGCCTTTCCTGATTTGGAGCTATTCATCAACTTAAGCGGAGCTGTATTCCTCTCAAGTCTTGGACTTTTGACGCCAGCAATAGTAGACACAGTTCACAACTGGAATCGGGGTCTGGGAAAATACAATTGGATTTTATGGAAAAATATCCTGGTAATGATGTTATCGTTCATTGCTCTGTTCGCTGGATCTTACGTATCAATTGTTGGCATAGTCGAAAAATACAATACCACACACAATTTGGAATCCAATATGAACTCAACTCTAAGAACATAA

Protein sequence:

>DPOGS202676-PA
MGHDEKKGTIILDNFNSTANLTANPGFQSTLSLGSKDVINEKAYNPFEHRKVSHPNSTIGSLVHLLKSSLGSGILAMPAAFKNAGLAVGAFGTIIIGFICTHCVYVLVKTSQEVCVEAKKPSMGFAETCGAAFEFGPKKLRPWANFARTFIDYTLTCTYLAALCVYVVFIAENFKEVLDEYYPEYKLSVEAYCALTLVPLVLICQIRNLKWLVPFSAVANIFLVICFAITMYYIFDDLPNPAERQMVASFTQWPLFISTVIFAMEGIGVVMPVENEMAKPQQFLGCPGVLNVAMTIVISLYGIVGFFGYIKYGDTVRGSVTLNLPQDEILAQSAKILMALAILFTYSLQFYVPMEMIWRELHSKISIKYHNFMQITIRTTAVVGSVAIAAAFPDLELFINLSGAVFLSSLGLLTPAIVDTVHNWNRGLGKYNWILWKNILVMMLSFIALFAGSYVSIVGIVEKYNTTHNLESNMNSTLRT-