Monarch geneset OGS2.0

DPOGS209342
TranscriptDPOGS209342-TA951 bp
ProteinDPOGS209342-PA316 aa
Genomic positionDPSCF300336 + 25074-26752
RNAseq coverage5x (Rank: top 88%)
Annotation
HeliconiusHMEL0147902e-4643.28% 
BombyxBGIBMGA008012-TA4e-1429.91% 
Drosophila% 
EBI UniRef50UniRef50_F6WZF05e-4449.34%Uncharacterized protein n=3 Tax=Ornithorhynchus anatinus RepID=F6WZF0_ORNAN
NCBI RefSeqXP_001944491.18e-3036.00%PREDICTED: similar to jerky homolog-like [Acyrthosiphon pisum]
NCBI nr blastpgi|3453259435e-3946.70%PREDICTED: jerky protein homolog-like [Ornithorhynchus anatinus]
NCBI nr blastxgi|3453259432e-3846.90%PREDICTED: jerky protein homolog-like [Ornithorhynchus anatinus]
Group
Gene OntologyGO:00036762.1e-11nucleic acid binding
GO:00036778.6e-11DNA binding
GO:00007758.6e-11chromosome, centromeric region
GO:00055152.9e-10protein binding
GO:00063551.5e-07regulation of transcription, DNA-dependent
KEGG pathway 
InterPro domain[129-187] IPR0048752.1e-11DDE superfamily endonuclease, CENP-B-like
[3-53] IPR0066958.6e-11Centromere protein Cenp-B, DNA-binding domain 1
[2-65] IPR0090572.9e-10Homeodomain-like
[10-55] IPR0122871.5e-07Homeodomain-related
Orthology groupMCL23342 Specific divergent
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209342-TA
ATGTCTAAGCGTAAAAGAGTAGTGTTGTCTCTTGGAGACAAAATGAAGATAATCGAGGGTATAAAAAATGGTGAATCTGGCAGTAAGTTGGCGCAAATATATGGTGTAGGAGTATCTACGATATCAGACATTAAAAAAAATTCTGCTTCTTTAATGAAATTTACTTATGCTCTTGAGAAAGAAGATGGCAGCTCGCAACGTAAAGTCATGAAAAAACCTAAGAATGAAATTCTAGAAAACGCAGTTTTCACTTGGTTTTTGCAAAAGCGTGCATGCGGCCAGCGAATATCTGGACCTCTGTTATGTGAAAAAGCATTGGATTTTAATAAATGCTTGGTGGTGATACTTCTTTCAAAGCGAGCTGTGGATGAATACGATTTGGATTTTTTAAATGCGGACGAAACAGGTCTTAATTGGAAAGCTTTACCAAGCAGATCATTATCTTCGCGTCGTGAGAATGCAGCACCGGGACACAAAGTAAGTAAGGATAGAGTTACAGTTATGGTTTGCGTGAATGCTAGAGGAACACATCGACTACCACTGCTACTTATTGTAGAAGTGTGGAACGCGGTGGAATCGCAAACACTGAAACGGGCATGGAATAAATTATTAAAATTAAGTCCTTCTGCTAATCCTGTCATGAATCCGCAAGAAGATTATTTTGAAGAAATTACAGAAGCGGTGAAAATTCTTAGCATTGGTGAAGTATGCGATGAAGAAAATATCAAAGAGTGGCTGGACTGCGATAAAATAGTTGACGATTTAAATGGTTGCGAGAATGCAGAAAAAGAAGAGACTGAAACTAAGGATGAACGTCAAGGACCATCTCACGCTGAAGCTTTTGAAGCTCTAGAAATAGCTTTTAAGTGGTTTGAAAGACAGGAGGAATCGGATTCCTTACAGCTGTTACAACTGATGCGCATCAGGGATTTAGCTGCATTGAAAAGATGA

Protein sequence:

>DPOGS209342-PA
MSKRKRVVLSLGDKMKIIEGIKNGESGSKLAQIYGVGVSTISDIKKNSASLMKFTYALEKEDGSSQRKVMKKPKNEILENAVFTWFLQKRACGQRISGPLLCEKALDFNKCLVVILLSKRAVDEYDLDFLNADETGLNWKALPSRSLSSRRENAAPGHKVSKDRVTVMVCVNARGTHRLPLLLIVEVWNAVESQTLKRAWNKLLKLSPSANPVMNPQEDYFEEITEAVKILSIGEVCDEENIKEWLDCDKIVDDLNGCENAEKEETETKDERQGPSHAEAFEALEIAFKWFERQEESDSLQLLQLMRIRDLAALKR-