Monarch geneset OGS2.0

DPOGS207844
TranscriptDPOGS207844-TA1224 bp
ProteinDPOGS207844-PA407 aa
Genomic positionDPSCF300042 + 1183472-1185748
RNAseq coverage25x (Rank: top 77%)
Annotation
HeliconiusHMEL0153131e-14764.42% 
BombyxBGIBMGA000582-TA3e-9242.71% 
DrosophilaCG4928-PB4e-8340.62% 
EBI UniRef50UniRef50_E2AUR71e-8743.56%UNC93-like protein n=3 Tax=Camponotus floridanus RepID=E2AUR7_CAMFO
NCBI RefSeqXP_001601549.11e-10147.88%PREDICTED: similar to UNC93A protein, putative [Nasonia vitripennis]
NCBI nr blastpgi|3838553321e-9847.06%PREDICTED: UNC93-like protein-like [Megachile rotundata]
NCBI nr blastxgi|3454892666e-10649.10%PREDICTED: UNC93-like protein-like isoform 1 [Nasonia vitripennis]
Group
KEGG pathway 
InterPro domain[6-394] IPR0161966.9e-17Major facilitator superfamily domain, general substrate transporter
[7-83] IPR0102913.8e-06Ion channel regulatory protein, UNC-93
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207844-TA
ATGTTCCTACCTCTTATAGTCATCAAATGGCTCGGAACCAAATGGACAATATCCTTGTCATTCCTAGCTTACCTACCTTACTTTGCCGCACAGATGTACCCTAGTTTTTATACTTTAATTCCAGCGGCTTTCATTATGGGAATAGGTGGAGGACCTCTTTGGTGTGCAAAAAGCACTTATTTGTCTGCCGCAGCCGAGGCTAATACAAAGACTTCTAATCTGTGTTTAGAGGTTTTATTGGTGCGTTTCTTTGGAATATTTTTTATGATCTATCAACTGAATCAAGTTTGGGGAAATCTTATTTCATCGTTAGTTTTATCGTCGGGTGATAATTCAGCAGCTGTTACGGCTATAAATGACACCATGATAGCTCAGCTTTGCGGTGCTAATTTTATGCCAAGTGCACATGCAGATGAAGCATTACAGCGACAACCTCCAGAAAAGATTCAGATGATTTCAGGTATATATTTGGGATGTACAGTTGCTGCATCCCTTTTAGTTGCTGTTGGTGTAGACTCCATTAAGAGTAATAAAATTGATCAGAATAATGCCAGTAAGTCTGGAATACATCTGCTGGCTACCACACTTAAACTACTTGTAGAGCCTAAGCACCTCATGTTAGCCAGCATTAACGTTTTTGTTGGCATGCAACAAGCTTTTTTTGGGGCTGATTTCACTGCCGCGTTTGTCTCTTGTTCTGTTGGCGTTGGAACTGTCGGGTTTGTTATGATGACTTTCGGCTTCGCCAACGCCTCAGGGTGTGTTGTTATGGAGCAAATGGCTAAGACAGTTGGACGTTTACCACTTATTATAGCGGCTTTTATTATTCATGGTTCACTAATGGTGACACTCCTCACATTTAACCTCCAACCAAATCAGCCTGTAGTCATGTATGTCATTGCATGTTTGTGGGGATTCTGCGATTCTATTTGGGCGGTGCAAATTAGCGCGTTCTATGGAATAGTTTTTAAAGGCAGAGAGGAAGCGGCGTTTTCCAACGTAAGACTCTGTGAGTCATTTGGCTATATTATTGCTTACATAATTTCTCCACACCTGAAGACTGGTGTAAAGACTTATATCCTAATGGTGACTATGCTGGTTGGAGTGGTGCTATACATCATAGTGGAATTTAGTGAAAGGAAAGCCAATACATCGACCGAAACATCAGAACCACAAGAGAAATATATTGATTTTGATAACAAAGCTTTCGAATATTCTAAGTAA

Protein sequence:

>DPOGS207844-PA
MFLPLIVIKWLGTKWTISLSFLAYLPYFAAQMYPSFYTLIPAAFIMGIGGGPLWCAKSTYLSAAAEANTKTSNLCLEVLLVRFFGIFFMIYQLNQVWGNLISSLVLSSGDNSAAVTAINDTMIAQLCGANFMPSAHADEALQRQPPEKIQMISGIYLGCTVAASLLVAVGVDSIKSNKIDQNNASKSGIHLLATTLKLLVEPKHLMLASINVFVGMQQAFFGADFTAAFVSCSVGVGTVGFVMMTFGFANASGCVVMEQMAKTVGRLPLIIAAFIIHGSLMVTLLTFNLQPNQPVVMYVIACLWGFCDSIWAVQISAFYGIVFKGREEAAFSNVRLCESFGYIIAYIISPHLKTGVKTYILMVTMLVGVVLYIIVEFSERKANTSTETSEPQEKYIDFDNKAFEYSK-