Monarch geneset OGS2.0

DPOGS202641
TranscriptDPOGS202641-TA1359 bp
ProteinDPOGS202641-PA452 aa
Genomic positionDPSCF300039 - 712705-715568
RNAseq coverage21x (Rank: top 79%)
Annotation
HeliconiusHMEL0078530.070.80% 
BombyxBGIBMGA014209-TA2e-13077.19% 
DrosophilaCG8785-PB1e-10445.27% 
EBI UniRef50UniRef50_Q7K2W32e-10245.27%CG8785, isoform A n=18 Tax=Endopterygota RepID=Q7K2W3_DROME
NCBI RefSeqXP_001648128.12e-10645.77%amino acid transporter [Aedes aegypti]
NCBI nr blastpgi|2897405532e-10646.82%amino acid transporter protein [Glossina morsitans morsitans]
NCBI nr blastxgi|2897405533e-10547.20%amino acid transporter protein [Glossina morsitans morsitans]
Group
KEGG pathway 
InterPro domain[45-445] IPR0130571.3e-66Amino acid transporter, transmembrane
Orthology groupMCL22353 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202641-TA
ATGAATGAAACACAAACAATAACGAGTACTCAAAGTGTTCTGAGAGATATAAGGACAGAGATAACAGATAATGATGCAAAGGAAGATGTTAATCGTTATGTACCAGCCGAACATAGGCCGCGGGAGTCAAACACATCAAGTTTCGGAGCGCTGGCCCATCTTCTTAAAGCTTCTTTGAGTTCAGGTGTTTTAGCGATGCCAGTGGCGTTCAAGAATGCTGGACTCATCACTGGAATTATTGGAACAATATTTGTTGGTCTGATTTGCGTTCACGTAACTCATATATTTGTAAAAACATCACAAGCTCTATGTGTAGATATTAAAAGACCATGTTTAGGATATTCTGAAACTTGCTATTCAGTTTTCAAAAATGGACCAAAATCGGTCCAAAAATTTGCCTCTATAGCCAGATTTCTAGCAGATTGTTCTCTGGCTGTCACACATTTAGGAGCTTGCTGTGTATATATCGTGGTTGTTGCTGAGAGTTTTAAACAAGTTTCTGATGAGTATTGTGGTCCATCGTGGTCAGTATCGGCATTCTGTGCTCTGACCCTGATTGTGTTAATACCGCTCACACAAATCACGAAACTGAAGTACTTGGTCCCATTCTCAACATTTGCAAATTTTGTATGGCTCACCTCTATTTGTATATCGTTATATTACTGTTTGCGAAAATCACAACCGCTTTCGAAACGGAATTTATCTACATCTTTCTCTGGATTCGTTAACTTTATAAGCACAAGTTTATTTGCTATGGAAGGCATTGGAGTGGTGATGCCGATTGAAAATGAAATGTTGAAGCCGAATCAATTCCTCGGATGTCCGGGAGTCTTAACAATAGCTATGAGTGCTGTGGTCGCTTTATTTGCTTTTGTCGGATTCACAGGATATTTAAGTTTCGGTGAAGACGTAAGGGGTAGTTTGACACTCAATCTGCCTCATGATGAAATTTTAGCACAAGTAGCAAAGATTTTAGTTGCTTGTGTTATGTTACTCTCCTACGCATTAATATTCTACGTGCCTTTAGAAATTCTTTGGAAGAGGATAAAAAATAAATTTCATGAAAATAATCATAGGATTTGTGTTGCTTGTATAAGGTTGGCGGGTACAGTTTTCACCGTGGGCCTTGCCTGTGCGATACCTAGACTAGAACTCTTTATGGAGCTGGTCGGAGCTGTATGTTTATCGATCCTGGGTATTACATTTCCTGTTATTATTGAGACTGTTTTCCTCTGGGACAAAGATATGGGGAAATGGAAATGGATCTTATGGAAAAATACTTTTATCTTAATATTTTCGATTTTGGTTTTAATATCTGGAATATCGTGTTCTATTCAGACCTTGTTTCAAAAATTGTGA

Protein sequence:

>DPOGS202641-PA
MNETQTITSTQSVLRDIRTEITDNDAKEDVNRYVPAEHRPRESNTSSFGALAHLLKASLSSGVLAMPVAFKNAGLITGIIGTIFVGLICVHVTHIFVKTSQALCVDIKRPCLGYSETCYSVFKNGPKSVQKFASIARFLADCSLAVTHLGACCVYIVVVAESFKQVSDEYCGPSWSVSAFCALTLIVLIPLTQITKLKYLVPFSTFANFVWLTSICISLYYCLRKSQPLSKRNLSTSFSGFVNFISTSLFAMEGIGVVMPIENEMLKPNQFLGCPGVLTIAMSAVVALFAFVGFTGYLSFGEDVRGSLTLNLPHDEILAQVAKILVACVMLLSYALIFYVPLEILWKRIKNKFHENNHRICVACIRLAGTVFTVGLACAIPRLELFMELVGAVCLSILGITFPVIIETVFLWDKDMGKWKWILWKNTFILIFSILVLISGISCSIQTLFQKL-