Monarch geneset OGS2.0

DPOGS202220
TranscriptDPOGS202220-TA1104 bp
ProteinDPOGS202220-PA367 aa
Genomic positionDPSCF300149 + 254586-256329
RNAseq coverage317x (Rank: top 36%)
Annotation
HeliconiusHMEL0091964e-9754.97% 
BombyxBGIBMGA013495-TA2e-9052.21% 
DrosophilaCG17249-PA3e-2132.38% 
EBI UniRef50UniRef50_E2AUF36e-3233.42%Proline-rich protein PRCC n=3 Tax=Formicidae RepID=E2AUF3_CAMFO
NCBI RefSeqXP_002012120.11e-3030.34%GI16797 [Drosophila mojavensis]
NCBI nr blastpgi|3071707962e-3133.42%Proline-rich protein PRCC [Camponotus floridanus]
NCBI nr blastxgi|3071707961e-3631.12%Proline-rich protein PRCC [Camponotus floridanus]
Group
KEGG pathway 
InterPro domain[327-367] IPR0188001.1e-20Mitotic checkpoint protein PRCC, C-terminal
Orthology groupMCL18571 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202220-TA
ATGGCTTTAGTTGCTTACGAAAATAGTGACTCTAGTGAATTTGAGGACGACGCGGAAGAAAATAACATACCGGTGAAACAGCTCAAAAGAGAAGATGATTCTGCCACTAAGCCCGATTCAGCTGTACCTGAAAGTAGTACAAACCTGTTTAATCAACTTCCAAAACCAGTTAAAGACACTAACAGTGTAATAGAGGAAGATGATGAATTTTTACATAAAAAGGATCCCCAGCCCGACATAGTGAAACCTAAAACAAGGATAACAATACCTCTATTGAGTGATTTCAAAGATGTTCAAAATACTGTACCAACTTCCAAAACAAAATGTATGGGTGAGAAAAAGTCTGGACTACTCGGTATACTGCCACAACCGAAAAACAAATTTACGAGCACAACAAAATCACTAATTCCTAACGTTGTTTCACAAAACACAAAAACAAATAATATTAACAAGAAACAACTTCCATCACCAGTAAAGTTAAAGACGGAGGCAAAATCCTGTCTGGTGAATGAGTATTCCGATGAAAGCGACAACGATGAAGATGTACAAAATGATTTCTTTTCTATTCATAAATGTGTGGAGTTGCCTCCGGATGCACCATTGGATATTGATGTTAAGCCAGTCGAGCGACCATCCACTAAGGAACCGAGGAGTTTGGAGTCCTACTTTAAGAAGGAGACACATGTGGAATTACAACCAGACATAGATCTTGATAATCCTATGAACTATGTCGATGATACAATTCCGGAGCCAATTCAAAGCATAGAAAACAATACAAACGAGATCCTTAATGAGGAGGCTATATTAAAACTGACCGGCGCTCGAGGAAAGAGAAAACGAGAGGACATACAGATAGTAGATATCAATCAACAAGAAATATTAGATGAAGCGAGAGCGATGTTGATGGAGGGTCTGATGAAGGACACGAGTAAGATACAATCATCAAGTAGGAAAACTGGCAACGGACCAACCAGCCAGCAGAAGAGGAAACATCAAATCACTTATTTGGCGCACCAGGCTAAAGCCAATGAACAAGAGCTTCAAAACCAATGGGCCAACAACAGGATGTCAAGGAGACAGACACAGTCTAAATATGGATTTTAG

Protein sequence:

>DPOGS202220-PA
MALVAYENSDSSEFEDDAEENNIPVKQLKREDDSATKPDSAVPESSTNLFNQLPKPVKDTNSVIEEDDEFLHKKDPQPDIVKPKTRITIPLLSDFKDVQNTVPTSKTKCMGEKKSGLLGILPQPKNKFTSTTKSLIPNVVSQNTKTNNINKKQLPSPVKLKTEAKSCLVNEYSDESDNDEDVQNDFFSIHKCVELPPDAPLDIDVKPVERPSTKEPRSLESYFKKETHVELQPDIDLDNPMNYVDDTIPEPIQSIENNTNEILNEEAILKLTGARGKRKREDIQIVDINQQEILDEARAMLMEGLMKDTSKIQSSSRKTGNGPTSQQKRKHQITYLAHQAKANEQELQNQWANNRMSRRQTQSKYGF-