Monarch geneset OGS2.0

DPOGS202644
TranscriptDPOGS202644-TA1452 bp
ProteinDPOGS202644-PA483 aa
Genomic positionDPSCF300039 - 614391-625549
RNAseq coverage82x (Rank: top 64%)
Annotation
HeliconiusHMEL0056100.072.27% 
BombyxBGIBMGA001299-TA7e-16667.48% 
DrosophilaCG8785-PB2e-12451.41% 
EBI UniRef50UniRef50_Q7K2W33e-12251.41%CG8785, isoform A n=18 Tax=Endopterygota RepID=Q7K2W3_DROME
NCBI RefSeqXP_001861307.14e-12950.00%amino acid transporter [Culex quinquefasciatus]
NCBI nr blastpgi|1700504288e-12850.00%amino acid transporter [Culex quinquefasciatus]
NCBI nr blastxgi|1700504282e-12550.22%amino acid transporter [Culex quinquefasciatus]
Group
KEGG pathway 
InterPro domain[67-468] IPR0130572.6e-69Amino acid transporter, transmembrane
Orthology groupMCL10684 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202644-TA
ATGCAAGTAAATAGACAACGCAGGCTCTTCATCCAAATCGCCACAATGGGAAAAGAAGAACATCTTGACAATTTTAATTCAACAGCAAATTTAACGAAAAACGCCGGTTTTGTGTCCTCAATAAGTTTAGATCCGAAAGACGGCGCTAACAATGAAAAGGAATATAACCCTTTCGAACATCGAAACTTAGCCCACCCTAACTCAACATTTGGGTCGATCATTCATCTTCTCAAAGCATGTCTAGGATCTGGTATTCTAGCTATGCCGGCTGCTTTCAAAAATGCGGGCACTGCGGCCGGGATTGTAGGAACTTTATTGGCAGGATTTATTTGTACTCATGCAGTCCATATACTGGTAAAAACTTCTCAAGAGGCTTGCGTTAATGCTAAGAAGCCTTGTATGAGTTTCTCAGAAACAGTAGGCGCTGCTTTTAAATATGGACCAAAAAGGATGCGACATTTCAGTGGATTTGCCAAGCAATTAATTGACTACTCGCTTTTGATAACGTACTTGAGTGTCCTGATTGTATACGCTGTGTTTATTGGAGTTTCATTTAAAGAGGTTTTGGATGTATACTACCCAGAAGGAAATTTCTCAGTCCAAGTATACTGTATGTTGACACTCGTTCCGTTAGTGCTGATTTGTCAGATAAGGAACCTGAAGTACTTGGTGCCATTTTCAGCACTTGCAAACATAATGATTGCTATAGTTTTTGCTGTCACATTATATTATATGTTCGTGGACTTGCCTCCAGTCAGTGAGAGGGAAGTGGTAGCTAGTATTTCAACCTGGCCGCTGTTTCTCAGCACAGTAATATTTGCCATGGAAGGTATTGGAGTGGTAATGCCTGTTGAGAATGAAATGGCTAACCCAAAGAGATTTCTCGGATGTCCTGGAGTATTAAACATTTCTATGGTGATCGTGATTTCTATGTATTGTATTTTTGGATTTTTTGGGTACATTAAATATGGAGATGCTGTAAAAGGAAGCATAACTCTTAATCTTCCTCAAGACCAATGGGTTGCACAGTTAGCCAAATTACTAATGGCTCTAGTGATGTACTTTTCCTTTGCACTACAATTCTATGTCCCTATGGAAGGAATTCAACGTCTAATGCTGAGTAACTTGCCAGAAAAATATATTAATATTGTTCAAATAAGCATCAGGACTATTTTAGTGTCCATCTGCGTTTGCGTCGCAGCGGCTTTTCCAAATTTGGAGCTAGTGATAAGTCTAGTAGGAGCTTTATTTTTTTCAACCCTCGGATTATTGGTGCCAGCTATTGTCGACACAGTTTACAATTGGGAAAGGAATTTAGGCAAATTCTATTATGTTGCCATCAAAAATTTTATCATTGCTCTCATCGGTGTTATAACTTTAGTATCTGGTTCCTATGTATCTATTGTGGCTATAGTTGAAGACCTATCTAGTAATCATAACGATACTAAATAA

Protein sequence:

>DPOGS202644-PA
MQVNRQRRLFIQIATMGKEEHLDNFNSTANLTKNAGFVSSISLDPKDGANNEKEYNPFEHRNLAHPNSTFGSIIHLLKACLGSGILAMPAAFKNAGTAAGIVGTLLAGFICTHAVHILVKTSQEACVNAKKPCMSFSETVGAAFKYGPKRMRHFSGFAKQLIDYSLLITYLSVLIVYAVFIGVSFKEVLDVYYPEGNFSVQVYCMLTLVPLVLICQIRNLKYLVPFSALANIMIAIVFAVTLYYMFVDLPPVSEREVVASISTWPLFLSTVIFAMEGIGVVMPVENEMANPKRFLGCPGVLNISMVIVISMYCIFGFFGYIKYGDAVKGSITLNLPQDQWVAQLAKLLMALVMYFSFALQFYVPMEGIQRLMLSNLPEKYINIVQISIRTILVSICVCVAAAFPNLELVISLVGALFFSTLGLLVPAIVDTVYNWERNLGKFYYVAIKNFIIALIGVITLVSGSYVSIVAIVEDLSSNHNDTK-