Monarch geneset OGS2.0

DPOGS212137
TranscriptDPOGS212137-TA1932 bp
ProteinDPOGS212137-PA643 aa
Genomic positionDPSCF300038 + 120458-123064
RNAseq coverage1086x (Rank: top 12%)
Annotation
HeliconiusHMEL0049910.085.95% 
BombyxBGIBMGA006588-TA9e-11565.56% 
DrosophilaCG4364-PA1e-18050.25% 
EBI UniRef50UniRef50_Q9VL962e-17850.25%Pescadillo homolog n=14 Tax=Endopterygota RepID=PESC_DROME
NCBI RefSeqXP_001119862.10.062.61%PREDICTED: similar to CG4364-PA [Apis mellifera]
NCBI nr blastpgi|3838655440.064.98%PREDICTED: pescadillo homolog [Megachile rotundata]
NCBI nr blastxgi|3838655440.064.26%PREDICTED: pescadillo homolog [Megachile rotundata]
Group
Gene OntologyGO:00082839.8e-113cell proliferation
GO:00057309.8e-113nucleolus
GO:00056228.6e-18intracellular
KEGG pathway 
InterPro domain[6-277] IPR0106139.8e-113Pescadillo, N-terminal
[322-419] IPR0013578.6e-18BRCT
Orthology groupMCL13651 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212137-TA
ATGGTGTCTAAAAGAAAGAAGAAGTATTCGACCGGAGAAGGAGCTCAGTTTATGACCAGAAAGGCAGCTTTAAAAAAGCTTCAATTGTCCTTAAAAGATTTTAGACGAATATGCATCCTGAAAGGTATTTACCCCCGAGAACCCCGAAACCGGAAAAGGGCACAGAAAGGTGCTGGAGGCATCAAAACGTTATATCACACAAAGGATATCAAATTCTTACTACATGAACCAATTATATGGAAACTAAGGGAACTGAAAGTTTTCCAACAAAAAATTAGAAAAGCTCGTGCTATGCGAGAATATGGAAAAATGAGAAAATATTTCAGTGATTACCCAGAAATCAATATAGATCACATTGTTAAGGAGAGATATCCTACATTTGTAGATGCATTAAGGGATTTGGATGACTGTTTGACACTCTGCTTTTTGTTCAGCACTTTCCCCTCTTTAAAAAGGGTCCCCAGAGACCAATCTCTCCTGTGCAGAAGGTTAACTGTTGAGTTTATGCATGCCATCATAGCAGCAAAGGCTCTCCGTAAAGTTTTTGTTTCTGTCAAAGGATATTATTACCAAGCAGAAATTGAAGGACAAACTATTACCTGGATTGTTCCTCATCATTTCTCATTTAAACCACAAAATAAAGATGAAGTAGATTTTAAAATCATGTCTACATTTGTGGAATTCTATATAATGGTGTTGGGATTTGTTAATTTCAAACTATTCCACTCCTTGAACCTAGTATATCCACCAAAATTAACCGCTGGGCTGAACTCAGATGCAGAAAAAGATTTAGTAGACGAAAAGGCTTATGTAGCAGAAAGAGTTGCTGCTATGAACATGTCAATAGCTCGTATTGCTGGTTCCAATGAAGCAGAGGAATTACCAGATATTGATGTCTTTAACACAGAAGACAATGACCCAGAGAAAATGGAGGAAGCTAAGAAAGAAGCTGAGAAGATTAAAGTACTCAAGACCATGTTTAAAGGACTGAAGTTCTTCATAAATCGGGAAGTACCAAGGGAACCTCTGGTGTTCATAATACGTTGCTTTGGTGGCGAAGTCTCTTGGGACAGAGACCATTTTGTTGGAGCTACTTTTGATGAATCTGATGAGACAATTGCATATCAAATAGTTGATAGACCATCTATGGATAAGCAATATCTATCTCGTTATTATGTACAGCCACAATGGGTATTTGACAGTGTAAATGCAAGGACACTGTTACCCATTAACAAGTACCTGATGGGTGCAGTATTGCCGCCACATCTATCACCATTCATTGATAAGTCGAAGGATCAAGTATATATGCCACCTGAGCAACGAGCTCTTAATGATTCCAACTTTAAACCTCTTGATGATGAACCATCAGACGAAGAAATTGAAGAAGCCAGCGATGAAGAGAAAGAGGAACAAAGTGAACCAGAGGATGGCGAGGAGGCTCTAGCTCGTCAATACAAGCAAGAGATGGAACAAGATTCACCATCGGAAGATGATGACGACAATGACCAGGACCCTGACAAGAAGAAGGCTGCCAAGGAAAAGAAAAAGGCCATGGCAGTTACAACTGGTGTACCCTTCAAAGAACACCCATACAAGAAAGAGATTGAAGACAAACAAGCATTCAGATTACGAGAGAAACTTGTTCCGAAGAAGCACAGAAACTTGTATAAGAGCATGAAAGCTGGACAGGAGAAAAGAAAGAAGGAAATCTGGCTTTTACGGAAGAAAAGGCGTCTCCATGATGAAAAGGTTTCAGAAGAGAAGAAAGCTGTGAAACGCAAACAAAAAATGCAAGCATTGGAAGCTTCAANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGCATCGCTATCTTCTAGTTCTGGCATTGGAAAACGTAGGATTGCTGCTATCCCAGTAA

Protein sequence:

>DPOGS212137-PA
MVSKRKKKYSTGEGAQFMTRKAALKKLQLSLKDFRRICILKGIYPREPRNRKRAQKGAGGIKTLYHTKDIKFLLHEPIIWKLRELKVFQQKIRKARAMREYGKMRKYFSDYPEINIDHIVKERYPTFVDALRDLDDCLTLCFLFSTFPSLKRVPRDQSLLCRRLTVEFMHAIIAAKALRKVFVSVKGYYYQAEIEGQTITWIVPHHFSFKPQNKDEVDFKIMSTFVEFYIMVLGFVNFKLFHSLNLVYPPKLTAGLNSDAEKDLVDEKAYVAERVAAMNMSIARIAGSNEAEELPDIDVFNTEDNDPEKMEEAKKEAEKIKVLKTMFKGLKFFINREVPREPLVFIIRCFGGEVSWDRDHFVGATFDESDETIAYQIVDRPSMDKQYLSRYYVQPQWVFDSVNARTLLPINKYLMGAVLPPHLSPFIDKSKDQVYMPPEQRALNDSNFKPLDDEPSDEEIEEASDEEKEEQSEPEDGEEALARQYKQEMEQDSPSEDDDDNDQDPDKKKAAKEKKKAMAVTTGVPFKEHPYKKEIEDKQAFRLREKLVPKKHRNLYKSMKAGQEKRKKEIWLLRKKRRLHDEKVSEEKKAVKRKQKMQALEASXXXXXXXXXXXXXXXXXXXXXXHRYLLVLALENVGLLLSQ-