Monarch geneset OGS2.0

DPOGS213053
TranscriptDPOGS213053-TA861 bp
ProteinDPOGS213053-PA286 aa
Genomic positionDPSCF300016 - 1314062-1315899
RNAseq coverage191x (Rank: top 48%)
Annotation
HeliconiusHMEL0150812e-11270.90% 
BombyxBGIBMGA007669-TA7e-11172.16% 
DrosophilaCG1142-PA6e-3139.89% 
EBI UniRef50UniRef50_UPI00021A67B13e-5143.51%UPI00021A67B1 related cluster n=4 Tax=unknown RepID=UPI00021A67B1
NCBI RefSeqXP_969663.22e-4842.15%PREDICTED: similar to Deoxynucleotidyltransferase terminal-interacting protein 2 (Terminal deoxynucleotidyltransferase-interacting factor 2) (TdT-interacting factor 2) (Estrogen receptor-binding protein) (LPTS-interacting protein 2) (LPTS-RP2) [Tribolium castaneum]
NCBI nr blastpgi|3407115111e-5043.51%PREDICTED: deoxynucleotidyltransferase terminal-interacting protein 2-like [Bombus terrestris]
NCBI nr blastxgi|3800300683e-5442.80%PREDICTED: deoxynucleotidyltransferase terminal-interacting protein 2-like [Apis florea]
Group
KEGG pathway 
InterPro domain[151-246] IPR0148107.4e-29Fcf2 pre-rRNA processing
Orthology groupMCL15075 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213053-TA
ATGGATTTTATAGTAGACACCGTGGGTGATGATGCACTCCAGAATAAAACAGAATTATTAGTTTCTGAGGACAAATTGTTTATAGAAAAAGAGAAGAAAACTGAAGCTATTTTGCAATTATTGTCCGATAAAAAGCAAGAATTAGATAGTAAAATAAATGAGGAGGAAGAAATCCAAAACAAACTTAAAGGCCTCAAAATAGATAATTTGTTTGATGAATTTTTTGATGATATGAGATGGACAAACTCTTTATTGATAAGAAGGAAAAGGAATAGAAAGTTTCAATTCGATCAATTAGACAAAGAGACTGGCAAGTTGGCATCTAACCTAGGAGGTGTTGATGTTGAGAAGGAAATGCAGAAGTCTGTGTTAAAACCGGGATTAGAAAAAGAACACACATTACCCAAATACAATATTAGTGACAAAGAATTGAGGGCTACAAGAAAGCAAGAGAGACAAAACACTAAAGGACCTGCTTGGTTCAATATGCGCGCTCCAGAAGTTTCTGAAGATCTGAAGAACGACCTGCAAGTGCTAAAGATGAGGTCGGCCCTGGACCCCAAACACTTTTATAAAAAGAATGATATGGAAGTATTGCCAAAATATTTTCAAGTTGGTCGTATCTTGGATTCTCCTCTGGACCATGTAAATGAAAGGGTAACGAGGAAGAATAGAAAAAGAACAATGGTCGAAGAACTGTTAGCCGATGCGGACTTTCAGAAGTATAATAAAAAGAAATACAAAGAGATCATAGACGAGAAGCGAAAGACAGAATACAGAACAGTCATGAGGGACAAGCGACAGAAGAGTAAAGCGGCACATAAAAGTAACAAGTTAAAAGCAAATAAGACTGCCAAATAA

Protein sequence:

>DPOGS213053-PA
MDFIVDTVGDDALQNKTELLVSEDKLFIEKEKKTEAILQLLSDKKQELDSKINEEEEIQNKLKGLKIDNLFDEFFDDMRWTNSLLIRRKRNRKFQFDQLDKETGKLASNLGGVDVEKEMQKSVLKPGLEKEHTLPKYNISDKELRATRKQERQNTKGPAWFNMRAPEVSEDLKNDLQVLKMRSALDPKHFYKKNDMEVLPKYFQVGRILDSPLDHVNERVTRKNRKRTMVEELLADADFQKYNKKKYKEIIDEKRKTEYRTVMRDKRQKSKAAHKSNKLKANKTAK-