Monarch geneset OGS2.0

DPOGS214209
TranscriptDPOGS214209-TA1200 bp
ProteinDPOGS214209-PA399 aa
Genomic positionDPSCF300014 + 318593-320473
RNAseq coverage146x (Rank: top 54%)
Annotation
HeliconiusHMEL0068033e-16767.92% 
BombyxBGIBMGA005936-TA1e-15259.63% 
DrosophilaCG31759-PC2e-8438.91% 
EBI UniRef50UniRef50_B0XJI22e-10344.52%2-phosphodiesterase n=3 Tax=Culicidae RepID=B0XJI2_CULQU
NCBI RefSeqXP_972708.11e-11448.74%PREDICTED: similar to 2-phosphodiesterase [Tribolium castaneum]
NCBI nr blastpgi|910822333e-11348.74%PREDICTED: similar to 2-phosphodiesterase [Tribolium castaneum]
NCBI nr blastxgi|910822333e-11248.16%PREDICTED: similar to 2-phosphodiesterase [Tribolium castaneum]
Group
KEGG pathwayppp:PHYPADRAFT_1068172e-33 
 K12603 (CNOT6, CCR4)maps-> RNA degradation
InterPro domain[110-398] IPR0051357.1e-22Endonuclease/exonuclease/phosphatase
Orthology groupMCL15602 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214209-TA
ATGGCAGATTTCCCAGTCTTTCCTGAGAATCTGGAGACTCTGTATGCAGAAAAAGAACTCAATATATATAACTGGTATAGAGGAAAAGTTAATAATAGCAAAGGTAATGAACTAAGTGATGTACATGTTGAATGGAACTTTATTGTAAGTAGTTTTTCCTACACACCTAAATCAGAAGATATTGGACTAAAACTAAAATTAGAATGTATTCCAGTGAATGCAAAACTTAGCGGACCTGTTGTAGAGTGTATCTCAAAAAATTTAGTTGAAGCTGGACCTGGAAGCTGTCCATTTGAAACTAGACATATGTTTACACCTACAAAGTTAAACGGCAAAAGATTTAGATGTGTGTCATACAATATTTTAGCAGATTTATACTGTGACTCAGATTATACTAGAACCGTACTACACCCATACTGTCCACCATATGCATTGCAAATTGATTATAGAAAACAGCTCATTATGAAAGAATTAAAAGGTTACAATGCTGATATAATTTGTCTGCAGGAAGTTGATGGCAAGATATTCAATAAATGTCTTAAACCCTTTTTGGATAGTGACAATTTCAATGGGTTATTTTATAAAAAAGGAAAAACTGTTGCGGAAGGTTTAGCTTGTTTTTACAACAGGCTCAGATTTTGTTTAATAGAAGACTTTCACATTTTATTAGCCAAAGTGTTAGAAAAAGAGAGTTATCTCAAAAATATCTTTGATATAATAAAAAATAACACTGCTTTGATGGAAAGGTTACTTGATAGGTCAAGTGTTGCCAGTGCTACTGTACTACAGTCAATTGAAAACCCTAATGAAATTCTTGTTAATCCAGGTAAACGGATAAGTGTAATTTTGTGTGGAGATTACAATAGTGTCCCCTCATGTGGAATATATCAGTTGTTCACTACAGGCTTAGCACCAAGTTCTTTGGAGGATTGGAAATCAAATGCAAATGAAGCTGTGCATGATCTTACTTTGTCTCAAGATATTTTACTTGATAGTGCTTGCGGAACACCAAAATATACAAACTTTACACAAGGATTTGCTGAATGCATAGACTACATTTTTTATGAAAAGAACAATCTTTCTGTTAATCAGGTGATTCCACTTCCAAATGAAGAAGAGTTGAAAGCTCACATAGCATTGCCCAGTGTTGTATTTCCATCAGATCATATAGCACTTGTATCAGATTTAGAATTTAAATAA

Protein sequence:

>DPOGS214209-PA
MADFPVFPENLETLYAEKELNIYNWYRGKVNNSKGNELSDVHVEWNFIVSSFSYTPKSEDIGLKLKLECIPVNAKLSGPVVECISKNLVEAGPGSCPFETRHMFTPTKLNGKRFRCVSYNILADLYCDSDYTRTVLHPYCPPYALQIDYRKQLIMKELKGYNADIICLQEVDGKIFNKCLKPFLDSDNFNGLFYKKGKTVAEGLACFYNRLRFCLIEDFHILLAKVLEKESYLKNIFDIIKNNTALMERLLDRSSVASATVLQSIENPNEILVNPGKRISVILCGDYNSVPSCGIYQLFTTGLAPSSLEDWKSNANEAVHDLTLSQDILLDSACGTPKYTNFTQGFAECIDYIFYEKNNLSVNQVIPLPNEEELKAHIALPSVVFPSDHIALVSDLEFK-