Monarch geneset OGS2.0

DPOGS200807
TranscriptDPOGS200807-TA1554 bp
ProteinDPOGS200807-PA517 aa
Genomic positionDPSCF300249 - 37529-39243
RNAseq coverage1x (Rank: top 93%)
Annotation
HeliconiusHMEL0081383e-3051.03% 
BombyxBGIBMGA010196-TA2e-1129.34% 
Drosophila% 
EBI UniRef50UniRef50_UPI0001923F051e-13647.50%UPI0001923F05 related cluster n=7 Tax=unknown RepID=UPI0001923F05
NCBI RefSeqXP_002156794.12e-16652.14%PREDICTED: similar to zinc finger, MYM domain containing 1, partial [Hydra magnipapillata]
NCBI nr blastpgi|2211198855e-16552.14%PREDICTED: similar to zinc finger, MYM domain containing 1, partial [Hydra magnipapillata]
NCBI nr blastxgi|2211198858e-16052.40%PREDICTED: similar to zinc finger, MYM domain containing 1, partial [Hydra magnipapillata]
Group
Gene OntologyGO:00469831.2e-11protein dimerization activity
KEGG pathwayhmg:1002059392e-07 
 K07190 (PHKA_B)maps-> Insulin signaling pathway
    Calcium signaling pathway
InterPro domain[424-493] IPR0089061.2e-11HAT dimerisation
Orthology groupMCL15477 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200807-TA
ATGGCGTTTAGAGGAAGCTCAGACAAACTGTACACACGAAACAATGGCAAGTTTTTGGGATTAGTCGAATTATTGGGTAAATTCGATCCAATTATGGAGGAGCATTTGAAGCTGGCAACAACAGGTGGCATTTCTGATCACTACTGCGGTAAGGATATTCAAAACGAGTTAATTCATCTGATGGGCGAAAAAGTTACTTCCGAGATTATTTCACGAGACAAAAACGCTAAATATTATTCAATTATAGCGGACTGTACTCCAGACATAAGTCATGTCGAAGAATTGTCCCTGACTATTCGATTTGCAGATTTGACAGACGCTAACGTCGCTGTTAATGAGCATTTTATAGAGTTCATCCCAACCACTAGCTCTACAGGTGCAGGACTTACCGAAGTAATTTTGAGCATTCTCAAGAAACACGGACTCGAGATAGCGAATTGCAGATGTCAGGGGTACGACAATGGAGCTAACATGAAGGGAAAAAATATTGTTAATCGATGGAAAATACTCACTGATCATGTAGAACTCTACAGTGTGAAAAAGCTGAGTGATACGCGTTGGGAAGCCAAAATAAACAGCGTAAAATCGGTTCGCTATCAAATTTGTGAAATTCATGATGCTCTAGTTACATTGGCTAACGTTACGGAGAAAACCGATCACATCACCTCACACGAGGCAAGCACATTAGCAGAACAACTGAAAGATTTCGGTTTCATTCTTTCTTTGGTGGTTTGGTACGAAATTCTGTTTCAAATTAATGTTGTTAGCAAATCACTGCAATCTCAAGACCTGGATCTCGGTAAATCCGCAGAAATGCTCGAAAATTGTTGTAATTTCTTTGAAGAATACCGAAAACCTGGATTCAAGAAAGCTTATTGCACTGCAAGTGACTTAGCTAAAGAGCTTCAAGTTAATCCAGAATTCAAACCTGCAAAACGTTTACGACGTATAAAACGTCAGGCTGGTGAAATAGCTACGGATGAGCCAATTGAATCTCCTGAGAAAAGATTCGAAGTCGAATTTTTCAACAAGTTGCTGGACGTCGCTTTGATGTCAATAAAAGAGAGATTCCAACAATTAAAAGACTATTCGGACACGTGGTCTTTTTTACACGATATTAAAAAAAATCCGGAAAAAAAGAAACTTGCAGTATTATGTGCAAATCTCCAGCTTACTGTTGGTTCCAATTCCGATATCGATGGAGATAGACTATGTGACGAGCTCATAAGTCTGAAGCATTTTCTTCCCGGTGACAATATTTCGTGTATAACCGTACTCAATTTCATACGACAGCGCGAAATTCAGGAACTGTATCCAAATGTTTGGATTGCTTTCCGGATTTTTGCAACCATTCCGGTTACTGTTGCAAGTGGAGAACGCAGTTTTTCAAAACTAAAACTAATAAAAACGTATATTCGATCAACAACCTCTCAATCGAGACTCTCTAACCTAGCTACTCTGTCGATTGAAAATGAAATCGCTGGACAATTAGATTTTTCTCAATTAATTCGATCATTTGCTGACAAAAAAGCCCGGAAGGTGAAATTTTACTGA

Protein sequence:

>DPOGS200807-PA
MAFRGSSDKLYTRNNGKFLGLVELLGKFDPIMEEHLKLATTGGISDHYCGKDIQNELIHLMGEKVTSEIISRDKNAKYYSIIADCTPDISHVEELSLTIRFADLTDANVAVNEHFIEFIPTTSSTGAGLTEVILSILKKHGLEIANCRCQGYDNGANMKGKNIVNRWKILTDHVELYSVKKLSDTRWEAKINSVKSVRYQICEIHDALVTLANVTEKTDHITSHEASTLAEQLKDFGFILSLVVWYEILFQINVVSKSLQSQDLDLGKSAEMLENCCNFFEEYRKPGFKKAYCTASDLAKELQVNPEFKPAKRLRRIKRQAGEIATDEPIESPEKRFEVEFFNKLLDVALMSIKERFQQLKDYSDTWSFLHDIKKNPEKKKLAVLCANLQLTVGSNSDIDGDRLCDELISLKHFLPGDNISCITVLNFIRQREIQELYPNVWIAFRIFATIPVTVASGERSFSKLKLIKTYIRSTTSQSRLSNLATLSIENEIAGQLDFSQLIRSFADKKARKVKFY-