Monarch geneset OGS2.0

DPOGS203371
TranscriptDPOGS203371-TA1440 bp
ProteinDPOGS203371-PA479 aa
Genomic positionDPSCF300003 + 300710-304592
RNAseq coverage322x (Rank: top 35%)
Annotation
HeliconiusHMEL0034300.089.38% 
BombyxBGIBMGA002105-TA0.077.10% 
DrosophilaCG9135-PA2e-16658.80% 
EBI UniRef50UniRef50_Q9VMI53e-16458.80%CG9135 protein n=23 Tax=Neoptera RepID=Q9VMI5_DROME
NCBI RefSeqXP_308044.25e-17059.87%AGAP002151-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|583763019e-16959.87%AGAP002151-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|583763014e-16559.45%AGAP002151-PA [Anopheles gambiae str. PEST]
Group
KEGG pathwaydya:Dyak_GE211437e-29 
 K10615 (HERC4)maps-> Ubiquitin mediated proteolysis
InterPro domain[111-462] IPR0090911.8e-82Regulator of chromosome condensation/beta-lactamase-inhibitor protein II
[129-178] IPR0004082.8e-11Regulator of chromosome condensation, RCC1
Orthology groupMCL11148 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203371-TA
ATGTCAAACAACGGCAGTCGCAAGCGCTCCGCTCCAGCCTCGGGAAGGGCCTCTAAGGCTCGGAAGCCCCGCAAGCCAGCCTCAGACGAAGACAGCAATGACTCGGTGCTCTCCGACCAGCCACTGGAGCCGCAGCGGGAACCATCCCCGCTACCCGATGAACCAACTATCAAGTTACCTGAGGAATTACTCAAATCCTTCTACAAAACACCCGGAGTACTTTTGATTAGCGGTCTTGTATCATGGGATCTCGTTGCCAAAAGAGACAACAATCCTTCAAAGAAAACCCATCCCAATCTGTATACTTTTCACAAGTTTACCGAACAAAAGTACAGGCTGATAGTGAGTGGATGCAGTGCCGGACATTCAATTCTGATTTCGGCGGAGGGCGAAGCTTATTCATTTGGTCGCAACTCATGCGGTCAACTCGGCTTCGGTGACACCACGACCAGAAACATTCCGGAGCCCATCCCAACCCTCAAGGGGTTCAACATCATTCATGCTGCCGTTGGAAGAAACCATAGCTTATTTGTGACTGATACGGGTACCGTGTACGCGTGTGGGGACAACAAAAGTGCTCAGTGTGGCCTCGGACACACCACGCCCCAGATATTAGTGCCGACACGCGTGCGATACACCGGAGCTCCAATAGTGAAGGTTGGCTGCGGTGCTGAGTTCTCTATGATATTGGACTGTAATGGGGCGCTCCACTCGTTTGGCCTACCCGAATATGGCCAGTTAGGTCACAACACCGACGGTAAATACTTCGTCACATCAACGAAGCTGTCCTATCATTTCGAGATGGTGCCGAAGCACATCGCTTTCTTCTTTGAGAAGTCCAAGGACGGCCACGTGAGCCCCGTCAAGGATGTGGACATAGTTGACTTCTCCTGCGGCAATAACCACACGGTGGCTATAGATTCCAAGAAGCGTGCGTATAGTTGGGGCTTCGGGGGCTTCGGTCGCCTGGGTCACGCCGAGCAGCGCGACGAGAGCGTGCCCCGGCTCATAAAATACTTCGACTCCCAGGCCCGGGGCGTGCGCTCCGTGCACTGCGGCGCCACTTACAGTCTGGCCGTCAACGAGCACGGGGCTCTGTTTATGTTTGGCCAAACGAAACGTACGGGTGAGGCTAATATGTACCCGAAACCAGTGCAAGATTTGACTGGCTGGAATATTCGTAGCGTGGGTACGAGCAACACATCTATAGTCATAGCCGCCGACGACTCGCTGATAGCTTGGGGCGTGTCACCTACTTACGGTGAACTGGGTACTGGTGATATAAACAAGTCGACCGCTCGGCCTAAGGAGGTGACCCGTATGGAGGGTCTCAATATAACTCAGGTGGCTATGGGCTATTCCCACACTCTGCTCCTCAGCGACGACACCTCTGATGAAGTGAAGCAGAAGCTCGCGTCTATGCCAACCTTCAACCCTTAA

Protein sequence:

>DPOGS203371-PA
MSNNGSRKRSAPASGRASKARKPRKPASDEDSNDSVLSDQPLEPQREPSPLPDEPTIKLPEELLKSFYKTPGVLLISGLVSWDLVAKRDNNPSKKTHPNLYTFHKFTEQKYRLIVSGCSAGHSILISAEGEAYSFGRNSCGQLGFGDTTTRNIPEPIPTLKGFNIIHAAVGRNHSLFVTDTGTVYACGDNKSAQCGLGHTTPQILVPTRVRYTGAPIVKVGCGAEFSMILDCNGALHSFGLPEYGQLGHNTDGKYFVTSTKLSYHFEMVPKHIAFFFEKSKDGHVSPVKDVDIVDFSCGNNHTVAIDSKKRAYSWGFGGFGRLGHAEQRDESVPRLIKYFDSQARGVRSVHCGATYSLAVNEHGALFMFGQTKRTGEANMYPKPVQDLTGWNIRSVGTSNTSIVIAADDSLIAWGVSPTYGELGTGDINKSTARPKEVTRMEGLNITQVAMGYSHTLLLSDDTSDEVKQKLASMPTFNP-