Monarch geneset OGS2.0

DPOGS210509
TranscriptDPOGS210509-TA1443 bp
ProteinDPOGS210509-PA480 aa
Genomic positionDPSCF300186 + 23847-26354
RNAseq coverage132x (Rank: top 56%)
Annotation
HeliconiusHMEL0046329e-4164.83% 
BombyxBGIBMGA012610-TA2e-13356.25% 
DrosophilaEdc3-PB2e-3527.46% 
EBI UniRef50UniRef50_D6WQJ43e-6934.86%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WQJ4_TRICA
NCBI RefSeqXP_974871.15e-7435.11%PREDICTED: similar to AGAP003131-PA [Tribolium castaneum]
NCBI nr blastpgi|910870711e-7235.11%PREDICTED: similar to AGAP003131-PA [Tribolium castaneum]
NCBI nr blastxgi|910870715e-6934.77%PREDICTED: similar to AGAP003131-PA [Tribolium castaneum]
Group
KEGG pathwaytca:6637431e-73 
 K12615 (EDC3)maps-> RNA degradation
InterPro domain[2-75] IPR0210245.8e-19Enhancer of mRNA-decapping protein 3, N-terminal
[179-200] IPR0190501.2e-10DFDF motif
[254-479] IPR0044432.2e-09YjeF-related protein, N-terminal
Orthology groupMCL14133 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210509-TA
ATGTCGAAGTGGATAGGTTACGCCGTGTCAGTAAACTGTGGCGAGCCCCTGGGATGTTATCAGGGTACTATACTGGAAGCCGACGGTAGCACCATCACGTTGACTAAAGCATTTAGAAATGGCTTCCCTTACCCAAAGTCTCAAGTCACACTGAATTCGGCTGATATAAAAGACTTGAAAATAATTGAAGAAGCACGTACAGAGCCATCGGAACAGACGCACAGCACTGTAGCTGTCACAAAGAGCGCCAAGAAAGGCCAAAGAGCCACTGTATGTGAAAATTTGGAGGCAAATCCCTCACATCCCACTGGATCACAGACCTGCAACAAGACGTGCAGCAGCAGGAGCGCACCCTCGGCGCCGCGGAGCAAGCCGATAGACATCCAGGGGCCGAGGATCAATAGGAATACCCATTCAGGCAGCTACGGCAATGCGAGCTCAACTCCTAAGACGCGCCCTCAGCCCGGGGGGGAGCGAGCGAGGAGGAGGAATGAGGCCTGCTTCGGACACGATGCGGACCCCGCGCTAGGAGACGACTTCGACTTCGAGGGAAACCTCGCGCTCTTCGACAAACAGGCGCTGTGGGAGGAGATGAGGACGACCGCCGCCACCAGGCCGGACGTGGTGCGAGCGGCGGACGAGGCAGCGCGGTACAGACACGACGAGAACGTGCTCGGGAGTGCGCCCCCCGCGGACCACATCACTGTCCCCGCGGACAGGAGGGGGCCCGTGGTGTACGCCGCGGACGACGGCCGCCGGGTGCCCTCGGTCACCCTCGATCTGCGCCGAGACTTCTGGCTCGGTCTGCGGCGGCTCGGGCTGCTGGAGGGCGCGCAGGTGCTGCTGGCGCGCGCCGCCGCGGACCTGGCGCTGCGGCTGGCGGGCGGCGGGCGGCGCCTCGAGCCTCGCAATGCCCACCAGGCGCCCGTGGCCGCCGTGCTGGCCGGGGTGCACGACGGCGGTGTGTGCGGACTCGTGGCCGCGAGGATTCTGGCGGCGCACGGCGTTGCGGCGCATGCCTTCCTGTCGGGGACGACGCGCGAGCCGCCCGGAGCGGCGTTTCGGCGTGAGCTAGGCGCGCTGGCGGCGGCGGGGGTGGCGCGGGCGGAGCGACCGGACGAGCTGCCGCCGGCAGACGTTGTGCTCCTGGCGCTGTCCTCGCCGGAAGAGCGCGAGTGTCAAGAGCCACAAGACACGCACGAGGCGGCGCTGGCGTGGGCGAGGGCGGCGCGCTCGGCGTGCGTGGCGCTGGAGCCGCCTGCGGAAGGCTGGCCGGGTGTTTCGTGCCGCGCGTCCGTAGTGGCGGGCCTGCCGGCTGCTCTGTCCCCATCCTTGGGCCGGGTGTATGCGGCCAACGTCGCGGCTCCGGCCCGCCTGTGGCGCGAGTTAGGCGTGTCCTACCGCCCGCCCTTCGGAGCCGCCTCGGTACTGGCGCTGGACTGA

Protein sequence:

>DPOGS210509-PA
MSKWIGYAVSVNCGEPLGCYQGTILEADGSTITLTKAFRNGFPYPKSQVTLNSADIKDLKIIEEARTEPSEQTHSTVAVTKSAKKGQRATVCENLEANPSHPTGSQTCNKTCSSRSAPSAPRSKPIDIQGPRINRNTHSGSYGNASSTPKTRPQPGGERARRRNEACFGHDADPALGDDFDFEGNLALFDKQALWEEMRTTAATRPDVVRAADEAARYRHDENVLGSAPPADHITVPADRRGPVVYAADDGRRVPSVTLDLRRDFWLGLRRLGLLEGAQVLLARAAADLALRLAGGGRRLEPRNAHQAPVAAVLAGVHDGGVCGLVAARILAAHGVAAHAFLSGTTREPPGAAFRRELGALAAAGVARAERPDELPPADVVLLALSSPEERECQEPQDTHEAALAWARAARSACVALEPPAEGWPGVSCRASVVAGLPAALSPSLGRVYAANVAAPARLWRELGVSYRPPFGAASVLALD-