Monarch geneset OGS2.0

DPOGS203519
TranscriptDPOGS203519-TA1293 bp
ProteinDPOGS203519-PA430 aa
Genomic positionDPSCF300055 - 445661-451078
RNAseq coverage204x (Rank: top 47%)
Annotation
HeliconiusHMEL0041052e-8495.71% 
BombyxBGIBMGA004348-TA0.086.65% 
DrosophilaCG7083-PA6e-12559.63% 
EBI UniRef50UniRef50_Q9VSH98e-12359.63%UPF0183 protein CG7083 n=42 Tax=Eumetazoa RepID=U183_DROME
NCBI RefSeqXP_001659495.12e-13362.13%hypothetical protein AaeL_AAEL008793 [Aedes aegypti]
NCBI nr blastpgi|3228012804e-13261.66%hypothetical protein SINV_07382 [Solenopsis invicta]
NCBI nr blastxgi|3407119622e-13062.20%PREDICTED: UPF0183 protein CG7083-like [Bombus terrestris]
Group
KEGG pathway 
InterPro domain[52-417] IPR0053735.3e-159Uncharacterised protein family UPF0183
Orthology groupMCL12935 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203519-TA
ATGTCGTTCAACTCCCCGGAGATCACGCCATCTATCGAGCAAGTGGAGCACTGCTTCGGCGCGACTCACCCCGGCCTCTACGACAGCCAGAGACATCTGTTCGCGTTGAATTTCAGAGGCCTGACATTTTATTTCCCCGTCGATAGTAAATTTGAGAATCCACTGTCCGTGGACCTAGTCATAAACATGCCTCAGGACGGGATACGGTTGATATTTGACCCTGTAGCTCAGAGGCTGAAGATCATAGAGATATATAATATGAAATTAGTTAAACTTAGGTATAGCGGCATGTCGTTCAACTCCCCGGAGATCACGCCATCTATCGAGCAAGTGGAGCACTGCTTCGGCGCGACTCACCCCGGCCTCTACGACAGCCAGAGACATCTGTTCGCGTTGAATTTCAGAGGCCTGACATTTTATTTCCCCGTCGATAGTAAATTTGAGCCGGGCTACGCTCACGGCCTCGGCTCTCTCCAGTTCCCTAACGGCGGCTCGCCCGTCGTTTCTCGGACAACTATATACTATGGATCTCAGCATCAGTTGAGCCGGTCAGCGAGCGGTCGCTGTACGGCCCCACTGGCAGAGCTGCCGCTGTCTTGTTATAGACACCAGCTTCATCTGCGACGATGCGACGTCCTGCGCTCACCCTCCGCGACCTTAGGACTCCGCCTGCACATATACACGGAGGGTACTAGATCTGGTGAACCGGCCTCCGCCCGGCGACGCGTGGTTCGTTTCGGAGACAGTTGCCAGGCTGTCTCCAGGGCTCTGGCGGCGCCCGCCCGCCTATACTACAAGGCGGACGATAAGATGCGTATACACAGACCCACCGCCCGCCGCCGCCCACCACCGGCATCAGACTACTTCTTCAATTACTTCACTCTCGGCCTGGACGTTTTATTCGACGCCCGCACGCACCAGGTGAAGAAGTTTATTCTTCACACCAACTACCCCGGCCACTACAACTTCAATATGTACCACAGATGCGAATTCGAACTCAACGTGCAGCCCGACAAATGCGAGTCCAACACACTGGTCGAATCCCGCGGCGCCGTCTGCATCACCGCGTACAGCAAGTGGGAGAACGTGTCGCGAGCGCTGCGGGTCTGCGAGCGGCCGGTCGTCCTCAACAGGGCCTCGTCCACTAACACCACCAACCCCTTCGGCTCCACCTTCTGCTACGGATACCAGGACATGATCTTTGAGGTGATGTCCAACAACTACATAGCGTCAATAACTCTGTATCAACCGGAAGGCACCCGGCCGCACTACGCGGTCACCTCGATCGCGTGA

Protein sequence:

>DPOGS203519-PA
MSFNSPEITPSIEQVEHCFGATHPGLYDSQRHLFALNFRGLTFYFPVDSKFENPLSVDLVINMPQDGIRLIFDPVAQRLKIIEIYNMKLVKLRYSGMSFNSPEITPSIEQVEHCFGATHPGLYDSQRHLFALNFRGLTFYFPVDSKFEPGYAHGLGSLQFPNGGSPVVSRTTIYYGSQHQLSRSASGRCTAPLAELPLSCYRHQLHLRRCDVLRSPSATLGLRLHIYTEGTRSGEPASARRRVVRFGDSCQAVSRALAAPARLYYKADDKMRIHRPTARRRPPPASDYFFNYFTLGLDVLFDARTHQVKKFILHTNYPGHYNFNMYHRCEFELNVQPDKCESNTLVESRGAVCITAYSKWENVSRALRVCERPVVLNRASSTNTTNPFGSTFCYGYQDMIFEVMSNNYIASITLYQPEGTRPHYAVTSIA-