Monarch geneset OGS2.0

DPOGS202035
TranscriptDPOGS202035-TA672 bp
ProteinDPOGS202035-PA223 aa
Genomic positionDPSCF300053 - 107847-108805
RNAseq coverage476x (Rank: top 26%)
Annotation
HeliconiusHMEL0099373e-11084.30% 
BombyxBGIBMGA014605-TA5e-9076.35% 
DrosophilaCG31249-PA1e-3438.31% 
EBI UniRef50UniRef50_Q9Y2241e-4742.45%UPF0568 protein C14orf166 n=40 Tax=Euteleostomi RepID=CN166_HUMAN
NCBI RefSeqXP_001663377.12e-4842.17%cle7 [Aedes aegypti]
NCBI nr blastpgi|3070951502e-4943.50%hypothetical conserved protein [Triatoma matogrossensis]
NCBI nr blastxgi|1195860645e-4844.10%chromosome 14 open reading frame 166, isoform CRA_b [Homo sapiens]
Group
KEGG pathwaycfa:4842719e-25 
 K04257 (OLFR)maps-> Olfactory transduction
InterPro domain[1-223] IPR0192651.3e-77Protein of unknown function UPF0568
Orthology groupMCL12586 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202035-TA
ATGTTCAAGCTGAAATTAAAAGCACTTGGTCATCCAAATCCCGAAAATTTTAATTGTGAAGATGAAAAGGAATATAGAAGTATCGTTCTTTGGTTAGAAGACCAAAAAATCAGGCATTACAAAATAGAAGAAAGGGAAGGCCTGCGAAATATTGACAGTGACTCTTGGAAAGAAGCATATGACACATATCAGAAAGACCTAGTTAGTCCAATAAATAGTGGAGATCCAAATGAACAATTAAATTGGCTTCTCTCCTACGCTGTGAGGCTCGAATATGGAGACAACGTTACTAAATATAAGGATGTCAAAGTTGAACAACCTAAACAAGCAACACCTAATGTGGTGTCTTCGAACCCATTGGACAATCTTGACTTTGGCAGTTCTGCATTTGTTGCTGGTGTCGATAGAATATGTGCATTAACCGGTGTAGGTCCACATCCCAATCCGAAATTCAGACTATCAGCAGTAGCAAAGATATTAAAAACTGCACCTCATCCCGACCATAACAAAGGTGATGCAAGTATTGTGCAACAACCCAACGATGTGTTGAAACTCTTGTTTATACAGGACTTGAGGGAACTTCAAACTAAAATCAATGAAGCTCTAGTTGCAGTACAATCTGTAACAGCCGACCCTCGCACTGACACCACATTGGGCAGAGTTGGGAGATAA

Protein sequence:

>DPOGS202035-PA
MFKLKLKALGHPNPENFNCEDEKEYRSIVLWLEDQKIRHYKIEEREGLRNIDSDSWKEAYDTYQKDLVSPINSGDPNEQLNWLLSYAVRLEYGDNVTKYKDVKVEQPKQATPNVVSSNPLDNLDFGSSAFVAGVDRICALTGVGPHPNPKFRLSAVAKILKTAPHPDHNKGDASIVQQPNDVLKLLFIQDLRELQTKINEALVAVQSVTADPRTDTTLGRVGR-