Monarch geneset OGS2.0

DPOGS206401
TranscriptDPOGS206401-TA1362 bp
ProteinDPOGS206401-PA453 aa
Genomic positionDPSCF300192 + 268456-270837
RNAseq coverage222x (Rank: top 45%)
Annotation
HeliconiusHMEL0090260.080.79% 
BombyxBGIBMGA005799-TA7e-17979.18% 
DrosophilaCG3862-PA1e-11345.43% 
EBI UniRef50UniRef50_E2A0U74e-13053.47%Williams-Beuren syndrome chromosomal region 16 protein-like protein n=7 Tax=Endopterygota RepID=E2A0U7_CAMFO
NCBI RefSeqXP_395025.25e-12850.99%PREDICTED: similar to CG3862-PA [Apis mellifera]
NCBI nr blastpgi|2620918100.081.01%CG3862-PA-like protein [Plutella xylostella]
NCBI nr blastxgi|2620918100.081.20%CG3862-PA-like protein [Plutella xylostella]
Group
KEGG pathwaydre:5575138e-25 
 K10615 (HERC4)maps-> Ubiquitin mediated proteolysis
InterPro domain[46-450] IPR0090915.8e-76Regulator of chromosome condensation/beta-lactamase-inhibitor protein II
[401-448] IPR0004084.9e-11Regulator of chromosome condensation, RCC1
Orthology groupMCL13579 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206401-TA
ATGAACGCTATAAAATTATTTGTACAGCGGAGAAGTTTTCCATTGATTTTTAATCGCACAGCTACAACTAAAAAGAAAATACACGATCCCAGTGAAGAAGATCGTTTACCGATATTTCAATATCCTATCAGCAAGAGTTCAGATCGAAGAGTATATGTATGGGGCTTAGCGGAAACTGGCGCTCTTGGAATACATATGCCTAGAGGAAAGAAAAAAGGCAAAAAAGCTTATAGGAATAATTTTAGTTATGCCTGGCACCCTATAAGATCCAGTTTTTGTGAAAGGTTTGATGTAACAAATATAGCCTGTGGCTATGGTTTCACGGTTGCTGCGGTCAAAACCAAAGAACAGCACAAGGTTTTTGGTACAGGTATTAACACCGACTCTCAGATTGGTTACCATGCACCTAGAATGGGTCATCCTCTTGAAATGCTTGTCAGCCCTGCCCCTATTTTTATTCCGTATGCTTCTTTAGAAACTAAGATAACTGGTCTTGCAGCTGGCAGAGCACACACAGTCATACTAACAGACAATGAAGGAGTGTTTACATTAGGGAATAATGCTTACGGGCAATGTGGTAGGAAGATAAATCCTCAAGAAGAATACAAAGGAAGTATGGTGTCACATAATATTAAACATCTTGGGAAGGAAAATATCAAGAGTGTTTGTTGTGGACAAGATCACAGCCTTTTCATCACTGAATCAGGGAAAGTATATGCTTGTGGGTGGGGAGCAGATGGACAGACGGGTCTTGGGATATATGAAAATCAGGGATTTCCGGCTCGTGTTAAAGGAGACATTACTAGTGAAAATATAGTTAAAGTTGCGTCAACCGCTGATTGTGTGCTCGCGTTGAGTGACAGAGGTGAATTGTTTGGTTGGGGTAATTCAGAATATGGCCAGGTACCCATGAACACGAAACAGCAACAGGTCAACATGTCATACGCCTTGTTAAATTTTACAAAAGGTTTAGGGAAAATTGTTGACATCGCAGCTGGAGGGTCATTTTGTTTGATCTGTAATGATCAAGGTGATGTATTTGTTTGGGGCTTTGGATTGCTGGGCTTAGGGCCGAATGTACAACACACAAATAAACCGACACAAATACCGGCTCCATTGTTTGGCAGGAATGAATTCAATCCTGAATGTATGGTCACAAAAGTTGCTTGTGGGATAGGTCACTTAGCAGCGATAACTAACAGCGGGGATCTTTATGTTTGGGGCAGAAATAGATATGGTTGTTTAGGGCTCGGCCATCAGAATGACCAACATTTTCCATTAAAAATTGGTATCGGAGCTCATGTGTTGTCGGTGAACTGTAGTGTAGATCACACTGTGGCTGTATGCAAACCATTTACATAG

Protein sequence:

>DPOGS206401-PA
MNAIKLFVQRRSFPLIFNRTATTKKKIHDPSEEDRLPIFQYPISKSSDRRVYVWGLAETGALGIHMPRGKKKGKKAYRNNFSYAWHPIRSSFCERFDVTNIACGYGFTVAAVKTKEQHKVFGTGINTDSQIGYHAPRMGHPLEMLVSPAPIFIPYASLETKITGLAAGRAHTVILTDNEGVFTLGNNAYGQCGRKINPQEEYKGSMVSHNIKHLGKENIKSVCCGQDHSLFITESGKVYACGWGADGQTGLGIYENQGFPARVKGDITSENIVKVASTADCVLALSDRGELFGWGNSEYGQVPMNTKQQQVNMSYALLNFTKGLGKIVDIAAGGSFCLICNDQGDVFVWGFGLLGLGPNVQHTNKPTQIPAPLFGRNEFNPECMVTKVACGIGHLAAITNSGDLYVWGRNRYGCLGLGHQNDQHFPLKIGIGAHVLSVNCSVDHTVAVCKPFT-