Monarch geneset OGS2.0

DPOGS205824
TranscriptDPOGS205824-TA1881 bp
ProteinDPOGS205824-PA626 aa
Genomic positionDPSCF300081 - 472908-476522
RNAseq coverage486x (Rank: top 26%)
Annotation
HeliconiusHMEL0158960.071.58% 
BombyxBGIBMGA009312-TA1e-2946.67% 
Drosophilapigs-PA1e-3732.84% 
EBI UniRef50UniRef50_D2A0B67e-9656.51%Putative uncharacterized protein GLEAN_07302 n=3 Tax=Tribolium castaneum RepID=D2A0B6_TRICA
NCBI RefSeqXP_975598.12e-9656.51%PREDICTED: similar to growth arrest-specific 2 [Tribolium castaneum]
NCBI nr blastpgi|2700052743e-9556.51%hypothetical protein TcasGA2_TC007302 [Tribolium castaneum]
NCBI nr blastxgi|3214000800.062.31%growth arrest-specific protein 2 [Bombyx mori]
Group
Gene OntologyGO:00055153e-23protein binding
GO:00070501.6e-22cell cycle arrest
KEGG pathway 
InterPro domain[34-161] IPR0017153e-23Calponin homology domain
[233-315] IPR0031081.6e-22Growth-arrest-specific protein 2 domain
Orthology groupMCL16991 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205824-TA
ATGGAGCACACGTCAGACGAGCGTGACGTGACTAACATGACGGTTAGCATATTGAGCATACGTCCAAGTCGTCTGTCTGTCAAAGGCATCACCTACCTGAACGGGGAGAACTTCCTGGACGTGTTGGACAACGGCGCGGAACTGTGTCAGCTGGCGGTGGTGATACACGAGCGAGCGAGGGAAGCGCTCCACCAGGGACTCATAGTCGGACCGGTGCCAGCGATCCGCGGGCGCTGCTGGCAGCGAGCAGCTCGTCGCAGTTTCTTCTCACGCGACAACGCGGAGAACTTCATCACCTTCTGCAGAGAGCTCGGTGTACATGAGAATCTGCTGTTCGAGAGCGATGACCTGGTGTTACACAACCAACCGCGACAAGTCATCCTGTGTCTACTAGAGGTGGCCCGGCTCGCGACCAGGTTCAACGTGGAACCTCCGGGGCTGGTACAGCTCGAGAAGGAGATTGCCTTGGAAGAACGCGATTCCGGACTAGACTCCGCTATGTCGGGGGCAGCCTGGCAGTTCAGAGACAGCTCACCCTCGCCTATCAGAGACAAGTCTAAGAAAGAAGACGCTGTTTTATCGCTAGAACCAAATGATTCTCGTCCTTCGCTGCAGGACAACACCGACGCAGAGAGCACGAGATCCGTGGACACCGAGTCTAGCTCTGATACCGACGTTCCGCTAAGACCGACCAACGAACTGGACAAGAGGGTACAACTCGTTACGCGTCTGATGGAGCGAGGATGCAACTGCGGCTCAGGAAAATGTTCCAAGCTCCTGAAGGTGAAAAAAGTCGGCGAAGGGAGATACAACATCGCTGGAAGGAACGTCTTTATTAGATTACTTAAAGGACGCCACATGATGGTACGTGTGGGCGGGGGGTGGGACACCCTCGAGCACTTCCTCTCGCGTCATGAGCCTTGTCAGGTGCGCCTCGTGACGCAGGGCCGGCGTGACGTGCTTCCTGTGCCGCTGCCCCCCTCCCCTTCTCCGGCGCATTCACACTCACCTAGTCCCTCTCCTCTCCCAGCCTCCGCCGGGCCTTCTCCAGCCCACGCTGGCCCATCACCTGCACCCAGCAGCACCTCTCTTGCCGGCACCGTCTCTCCCATCGACAGCAAGGCATCATCTATTAGCGGCAAAACCTCTCCCGTTGATCCTAAGCTCGATAAAACTCGAACTCCTGGCAAGCCCGATAGCGGAAAGTGCTCCCCGCTCCGCCGCACTTCTACACCCAATAAACCAGTTTCATCCAGGAACAGATCTGCCTTGACACTGCCCCTCAAAGATTCGACCGAAAAGAAGTCCCCAGCACCCCGAGCTAGAAAGCAATCCGCACCAGCTAGCTTCAGCACTCCGAACACGCCGACCGGAAGGTCCATCCGGACTCCTCCCGTCGAACTTCGCAAGTCGCTCACTTCAAACACATTAATAAACGCACCTAAAAAATCGAGAAGCATGAGTCTAGCGAGCGCCCCCAACAATGAAAGGAAATCATTTTGTGGGACGGATAAACTGAATCAGTCCAAGAAGACCCGCAGCATGAGCGGCGCCAGTACCCCCAGTGAGTCGTCGGTGGCGCGCCCCACTCGCAGCCAGAGCCTCGCGTCTACCCCCGTCAATGGCCCCAAGAAACCTTTGGCCAATAGCTTCGCTAAGAAGACTCGGAGCATGAGCCTGGCGACGCCCATAGATTTCTCGAAACCGCTCACAAAGAGCGCTTCCATTGGTGCGACCGGACTCACTCAGGCCGCCATCGAAGAGAGCATCCGCTTGTCACTGGAGGCCAGCATAGCCGACAATACCTCCCCGAAAAAGCCATTTTTGCATATCAAAGCCAAATATCGTAGTCCGCCGCCGCGGGAGGTTCCCCCGAGATAA

Protein sequence:

>DPOGS205824-PA
MEHTSDERDVTNMTVSILSIRPSRLSVKGITYLNGENFLDVLDNGAELCQLAVVIHERAREALHQGLIVGPVPAIRGRCWQRAARRSFFSRDNAENFITFCRELGVHENLLFESDDLVLHNQPRQVILCLLEVARLATRFNVEPPGLVQLEKEIALEERDSGLDSAMSGAAWQFRDSSPSPIRDKSKKEDAVLSLEPNDSRPSLQDNTDAESTRSVDTESSSDTDVPLRPTNELDKRVQLVTRLMERGCNCGSGKCSKLLKVKKVGEGRYNIAGRNVFIRLLKGRHMMVRVGGGWDTLEHFLSRHEPCQVRLVTQGRRDVLPVPLPPSPSPAHSHSPSPSPLPASAGPSPAHAGPSPAPSSTSLAGTVSPIDSKASSISGKTSPVDPKLDKTRTPGKPDSGKCSPLRRTSTPNKPVSSRNRSALTLPLKDSTEKKSPAPRARKQSAPASFSTPNTPTGRSIRTPPVELRKSLTSNTLINAPKKSRSMSLASAPNNERKSFCGTDKLNQSKKTRSMSGASTPSESSVARPTRSQSLASTPVNGPKKPLANSFAKKTRSMSLATPIDFSKPLTKSASIGATGLTQAAIEESIRLSLEASIADNTSPKKPFLHIKAKYRSPPPREVPPR-