Monarch geneset OGS2.0

DPOGS207843
TranscriptDPOGS207843-TA1440 bp
ProteinDPOGS207843-PA479 aa
Genomic positionDPSCF300042 + 1169033-1173339
RNAseq coverage635x (Rank: top 20%)
Annotation
HeliconiusHMEL0153130.075.51% 
BombyxBGIBMGA005524-TA3e-12246.28% 
DrosophilaCG2121-PB5e-11844.80% 
EBI UniRef50UniRef50_E2AUR77e-12550.11%UNC93-like protein n=3 Tax=Camponotus floridanus RepID=E2AUR7_CAMFO
NCBI RefSeqXP_001601549.11e-14154.31%PREDICTED: similar to UNC93A protein, putative [Nasonia vitripennis]
NCBI nr blastpgi|3454892664e-13754.17%PREDICTED: UNC93-like protein-like isoform 1 [Nasonia vitripennis]
NCBI nr blastxgi|3454892665e-14554.39%PREDICTED: UNC93-like protein-like isoform 1 [Nasonia vitripennis]
Group
KEGG pathway 
InterPro domain[1-462] IPR0161962.5e-21Major facilitator superfamily domain, general substrate transporter
[24-141] IPR0102912e-08Ion channel regulatory protein, UNC-93
Orthology groupMCL16453 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207843-TA
ATGAAGGATCCGCAAGTTGAATACAAACCGAGGGAAACATGGAGAATAATGAAGAATGCTGTGATTATCAGCATAGCTTTCATGGTGCACTTCACAGCCTATAGTGGTGCAGCAAATCTTCAAAGCTCGATCAACGCGGAGTCCGGTCTTGGGACAGTTTCCTTGGCGGCGGTGTACGCTGGTCTTATATTTTCCAATATATTCCTCCCCGTTGTTATTATCAAATGGCTGGGCACAAAATGGGCGATATCTCTATCCTTTATAACTTATATGCCGTACATAGCTGCACAGTTGTATCCAACGTTCTATACCCTCATACCGGCCGCTTTGATCGTTGGTTTGGGAGGTGGGCCTCTATGGTGCGCTAAGTGCACTTACTTGTCTGTGATATCAGAGGCACATAGCACGATATCCGACATTTCGCCGGAGGTGTTATTAGTCCGATTTCTTGGCTTGTTTTTCATGATCTTCCAGTTCAATCAAGTTTGGGGGAATCTTATTTCATCTTTAGTGATATCATCGGGTGACAATGTGGCAGCTGTGACAACAGTCAACGATTCTTTCATACCTCAGTTGTGTGGAGGCAATTTCCTTCCCACCAAAGACGCGGGACAGGCTCTTCAACAACAGCCTCCAGAGAAGATACAGATGATAGCTGGCATTTATTTGGGGTGTATGGCCGCTGCCGCACTCATTGTCGCCGTCGGCGTGGACTCAATGAAAAGGTATAAAACAAGTCGTAGCCAGACCGGTTCCAGTCTCTCGGGGATGGCGCTCCTCGCTGTCACGGTCAAGTTACTGGTGGAGCCTAACCAGCTCATGCTGGTCATTATTAACATCTTCGTGGGCATGCAGCAGGCTTTCTTCGGGGCTGATTTCACTGCGGCGTTCGTGTCATGTGCCGTTGGCACTGGAACTGTGGGTTTTGTGATGGTGGCCTATGGGTTAGCTGACGCTATAGGATGTGTCGTGACTGGATATTTAGCGAAGTTAACTGGTCGAATGCCTCTGATCGGTCTGGCGACGGTCCTGCACAGCCTCCTATTAATGTCGCTGCTGGCATGGAGTCCTCAACAGCACCAAGCATACATCATGTACATCATCGCTGTTTTGTGGGGATTTTGTGATTCCATATGGCTAGTGCAGATCAATGCCTACTACGGAATCCTCTTCAAAGGTAGAGAAGAAGCCGCGTTTTCTAACTTCAGACTTTGGGAGTCAGTGGGCTACATCATAGCATACATCATATCACCGTTCTTGAAAACCAGTATTAAGACTTACATATTGATAGTAGCTATGATCGTAGGAGTAATCTTCTACTTCATAGTGGAATATAGAGACAGAAAGGCAAAAAGAGTGATAGAAATAGAAGAAAAGAAAACAAGGAAAGCGAAAAGTTTGGCCGGTCAAGAAAATACAGCTTATGTTCAAGGAGAATAA

Protein sequence:

>DPOGS207843-PA
MKDPQVEYKPRETWRIMKNAVIISIAFMVHFTAYSGAANLQSSINAESGLGTVSLAAVYAGLIFSNIFLPVVIIKWLGTKWAISLSFITYMPYIAAQLYPTFYTLIPAALIVGLGGGPLWCAKCTYLSVISEAHSTISDISPEVLLVRFLGLFFMIFQFNQVWGNLISSLVISSGDNVAAVTTVNDSFIPQLCGGNFLPTKDAGQALQQQPPEKIQMIAGIYLGCMAAAALIVAVGVDSMKRYKTSRSQTGSSLSGMALLAVTVKLLVEPNQLMLVIINIFVGMQQAFFGADFTAAFVSCAVGTGTVGFVMVAYGLADAIGCVVTGYLAKLTGRMPLIGLATVLHSLLLMSLLAWSPQQHQAYIMYIIAVLWGFCDSIWLVQINAYYGILFKGREEAAFSNFRLWESVGYIIAYIISPFLKTSIKTYILIVAMIVGVIFYFIVEYRDRKAKRVIEIEEKKTRKAKSLAGQENTAYVQGE-