Monarch geneset OGS2.0

DPOGS207130
TranscriptDPOGS207130-TA2235 bp
ProteinDPOGS207130-PA744 aa
Genomic positionDPSCF300001 + 3545947-3556855
RNAseq coverage383x (Rank: top 31%)
Annotation
HeliconiusHMEL0116860.088.38% 
BombyxBGIBMGA013089-TA0.081.59% 
Drosophilapigs-PA5e-8953.67% 
EBI UniRef50UniRef50_Q7QJZ53e-9445.49%AGAP003901-PA n=1 Tax=Anopheles gambiae RepID=Q7QJZ5_ANOGA
NCBI RefSeqXP_001355248.24e-9833.52%GA17814 [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastpgi|3479708811e-9345.49%AGAP003901-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3504073893e-10735.77%PREDICTED: hypothetical protein LOC100740651 [Bombus impatiens]
Group
Gene OntologyGO:00070501.1e-46cell cycle arrest
GO:00055153.7e-14protein binding
KEGG pathway 
InterPro domain[182-254] IPR0031081.1e-46Growth-arrest-specific protein 2 domain
[71-152] IPR0017153.7e-14Calponin homology domain
Orthology groupMCL16858 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207130-TA
ATGACATTAAAGCACCACGTGACCTGGAAATCATCAGAGCTCTGCCATATAAGGGCTGGTCCGCGCCGCCCCGTGCCAGTCGAGGCCGAGTGCTCGACGTGGAAGCACGCTAACGCCGTACGCGAATCAGCTAGACTCATCCTTGACGCACCTGCTCCTGAATCTGATGATGCCTTGACCCTAGCGAAGGCGCTGCGTTCTAGACCACCAGTCAACATGCTCCCTGCTGCCAAAGCAGGAACGTTTTTTGCAAGAGATAATTTATCAAACTTTATTGACTGGTGTCGCAGAGCACTCGGTATTTTGGAGTGCCTTTTATTCGAAACCGACGATCTATGTCTTCGTAAGAACGAAAAGCACGTCGTTTTATGTCTTTTAGAAGTAGCCAGGAAGGGAGCCGTTCTTGGAATGCCAGCTCCTCTCCTCGTACAAATGGAGAAGCAAATCGAAAGAGAATTGGCAGGGGAAGAATTAAGACCGGATGATTCAGCTCTAGGCCTTGTACCCTCCGGCCCTCAACCACAGCTGGTGACAAATGACTTGAGGAGTCTTGACGAGAGAGTAAGAGATTTGGTTGAGAGGTGCTCCTGCCCCACTCAGTTTCCTATGGTGAGAGTCTCCGAGGGCAAATACAGAATTGGAGATACGAGGCTTCTGATTTTCGTCAGGATTTTACGCTCCCATGTCATGGTTCGCGTTGGTGGTGGCTGGGATACACTGGCCCACTATCTGGACAAACATGACCCTTGCCGCTGCCGTGCTCAGCATAGAACATCCCTCTCAGCCCGCTTGGCTAGACCCAAGCAGGATCTTGCTGGAGCTACTGTTACATACGAACGTTCAGACCCTAGCCAAACATCCCAGCCATGCAAAGATTATAAAGAAACAAAGAATTATGAGCCAATGTCGCTCCAATACTCTTCCAAATTATACAGTGACGATCCTAGAGCTGGTCATTACCAATCCAACTGCAAACTGGACGCTCCTACAAGTTTACCATATTTAGGAGCTATGGATCGTGCTGCCTCCCCAAATAGAAAGATTCTGCAAAGAAACAATAGCCCTGGCCGGCATTCCTCCTCTCCAGATCGCAGGACTAAGATCGTGACCACAAACCATTTGGTGCCTACGCCTCATGGCGTAAGAAATAAAAGTCCGCGTCCCCTGTCTCCTGCTCCGGCAGCGGAAAGCGCTTCAGACAACGGGTCCGAGGTCTCTGACGAAGGTTACAGAAGCCTGGGAGTGGTCGCCGGCTCTACTCAAGGGTCGCCATCAAACAAAACAGCAAATCGATACTCACTGCACAGTCAGAATTCCATGGATGACGCTGATTTTAGTGAGCGTCTCGACGAGGACGGCTGTCAAATAGATAAAAACGAACGAGTTGATGACTACGTCAGCCTCAATACTGGATTGCGCAAGACAGATTTCTCTGACACATTCTACGGCAGCAGGAAGAACAGCTCAGAAGACAAATCCAACAGAGCCAGTCCTGAATGTATAGTAGTACACGAAAACAACGACTCACCGAGCAAGAGCCTAAGACCGACACGAGAATCAACACAATCACCAGCGAAGGCAATCAGAAATAGACCTGCTAGCAGGATACCACACTCACCAGTTAGGAACAGAACACCAAGCAGAGGCAACACTCCCAGTCCCAAGCATATCGCTAATCCACAATCATCACCAAAATTGGCACCAAAATTACCACCTACCAGCCGGAACACATGGGGAGGAAGATCAGCTCCAAATCAAGCCAAAGCCAAAACACGGCCAACTGTCGGAGCGGATACTTTTGAAAATCCCAACAAATCTCCAAAAGCGAAGCCAAAAGCTCCGCAAAATGAGGCCTTTAAACGGAACTCGCCTCTTCGGGCGAGCAGTGCAACTTTAAGGTCTCCCACTCACCAAAAAGCTTTAAGCCCTCTGCTCGAACAGATCTTGAGATCTGCAGAATCAGCTAAAGATGACGCCTCCGTTTTGGAGAAGATGAAGGAAATAATTAGGTCTTATTCTAAAGGCGAGGACTCGATATCAAGAACGAGTTCAAAAGATTCTGACTACGCAGATTTCACGTCGGCTTGGGTGATGTCTGATGGAAAGTTAGAGAGATCTACAAGCACTAGGCAATTAGCTGCACCCAGGAAGGATCCAAGGACCGGTGCATCCAGGATACCGGCTCCTGTGTCGCTGGGCTGCCGACGGTCAACCTCTACCTCGCAGTTCCCTTGA

Protein sequence:

>DPOGS207130-PA
MTLKHHVTWKSSELCHIRAGPRRPVPVEAECSTWKHANAVRESARLILDAPAPESDDALTLAKALRSRPPVNMLPAAKAGTFFARDNLSNFIDWCRRALGILECLLFETDDLCLRKNEKHVVLCLLEVARKGAVLGMPAPLLVQMEKQIERELAGEELRPDDSALGLVPSGPQPQLVTNDLRSLDERVRDLVERCSCPTQFPMVRVSEGKYRIGDTRLLIFVRILRSHVMVRVGGGWDTLAHYLDKHDPCRCRAQHRTSLSARLARPKQDLAGATVTYERSDPSQTSQPCKDYKETKNYEPMSLQYSSKLYSDDPRAGHYQSNCKLDAPTSLPYLGAMDRAASPNRKILQRNNSPGRHSSSPDRRTKIVTTNHLVPTPHGVRNKSPRPLSPAPAAESASDNGSEVSDEGYRSLGVVAGSTQGSPSNKTANRYSLHSQNSMDDADFSERLDEDGCQIDKNERVDDYVSLNTGLRKTDFSDTFYGSRKNSSEDKSNRASPECIVVHENNDSPSKSLRPTRESTQSPAKAIRNRPASRIPHSPVRNRTPSRGNTPSPKHIANPQSSPKLAPKLPPTSRNTWGGRSAPNQAKAKTRPTVGADTFENPNKSPKAKPKAPQNEAFKRNSPLRASSATLRSPTHQKALSPLLEQILRSAESAKDDASVLEKMKEIIRSYSKGEDSISRTSSKDSDYADFTSAWVMSDGKLERSTSTRQLAAPRKDPRTGASRIPAPVSLGCRRSTSTSQFP-