Monarch geneset OGS2.0

DPOGS210716
TranscriptDPOGS210716-TA1575 bp
ProteinDPOGS210716-PA524 aa
Genomic positionDPSCF300013 - 210433-240074
RNAseq coverage645x (Rank: top 20%)
Annotation
HeliconiusHMEL0070762e-15882.17% 
BombyxBGIBMGA001417-TA8e-2132.16% 
Drosophilaari-2-PA2e-2023.93% 
EBI UniRef50UniRef50_D6X4U98e-14050.91%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6X4U9_TRICA
NCBI RefSeqXP_002424373.12e-13950.39%RING finger protein, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3320174913e-13950.19%E3 ubiquitin-protein ligase RNF19A [Acromyrmex echinatior]
NCBI nr blastxgi|3071808071e-14149.91%E3 ubiquitin-protein ligase RNF19A [Camponotus floridanus]
Group
Gene OntologyGO:00082701.3e-21zinc ion binding
KEGG pathway 
InterPro domain[144-209] IPR0028671.3e-21Zinc finger, C6HC-type
[70-147] IPR0130834.3e-08Zinc finger, RING/FYVE/PHD-type
Orthology groupMCL12067 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210716-TA
ATGGTTGAACGTGAAGGAGTGCGCCGCGGAGGCTGCGGCTGGGGTGGCGGAGTCCTCGGTCGTTGGTCGCTGCGCCGTCTGTTGCTAGATAGCCCTTTAGTGGGCAGAAGACTTGCTCATTCAGTATTGGAGCAAGGGGAGCGAACTGTCAGAGTGGGCACGGAGCAGGCTCAGGCCAGACCTGCTGTCCAGTACATCGCTGAAGATGAAGAGGGCGAAGTTCTTGACTGTCCGCTGTGCCTGTCAACGTTTCCCTCGGCCAGTATGGTGAGGCTGGCATGGTGCTCCCACAGGTGCTGCGAGCCCTGTCTGCGGCAGTACCTCGGCATAGAGATATACGAGGCCAGGGTCCCAGTCAACTGTCCCGTCTGCCACGAGAATATGCACCCATCAGATGTTCGTCGGATAGTGGACAATCCGACGTTATATGATAAGTATGAGGAGTTTTCGGTGAGGAGAGCACTGGCTGCCGATCCTGATACCAGGTGGTGTCCAGCCCCTGACTGTAGTTTCGCGGTGGTAGCCAGCGGCTGCGCTTCCTGTCCCAAACTGACGTGCCTGTCGCCGGGCTGCGGAGCCTCGTTCTGTTACCACTGTAAGGCCGCCTGGCACCCCACACAGACCTGCGACGCGGCGAGGAGGACCCGCCAGCAGCCGCCACGTCAGCCGCAGCCTTCTTTGGACGAGCACGGCGAGGTGAAGCCCTGCCCGAGGTGTGCCGTCCTCATAGTGAAGGTGGAGGATGGTTCCTGTAACCACATGGTGTGCTGCGTCTGTGGGGCCGAGTTCTGTTGGCTGTGTATGAAGGAGGTGTCCGACCTGCATTACCTCAGCCCCAGCGGTTGTACGTTCTGGGGCAAGAAGCCGTGGTCGAGGAAGAAGAAGCTGTTGTGGCAGCTGGGGACCCTGGCGGGGGCTCCGCTGGGTATAGCGGTGGTGGCGGGCGTAGCTCTGCCTGCTCTACTGGTGGGTCTGCCGCTGTGGGCTGGGCGGAGGGCCCACAGGAGACTCAGGAAGGCCAGGCCCGCGAGACGCAGGGCCGCAGTCGCCGCCGCCGTCGCTGCTGCGCTGATAGTCTCTCCCCTGGTGGCGGGGCTGGCGGTCGGGATTGGAGTCCCCATCCTGTTGTTCTACGTGTACGGCGTGGTGCCGGTTTCCCTGTGCCGCGCGGGGGGCTGCGCCGGCGGGGGCGAAGCGCCCGAGCGGCCCGAGCAAGCCATCGACCTGGACGCCACCTCCAGGAGATTGGAGCCCAGCATCGGTGACGCGTCTCTGTCAGCGGGCTCGGCACAGGCCGAGGCTCGGGCGGCGTCTTTGGCGGGGCTGGCGGGGTCGCTGGGGGGTCAGGACGCCGCCTCGCCCGCCGCGCACCGCCTCGAGGTCACCGTGGACGTGTTCAGAGCCTTCGCCTCGGACACCGCGAGCGCCGCCAGCTGGCCGGACGACCTGCCCTCGACCTCCATCAGCTCGGCGGCGCGGCTCCGCAACCTGTTCCTGTACGGAGCGCCCCAGCCCCCTCCGGACCCCGAGGCGGGCGGGGTCGAGCCGCGCCCGGACACTCCACCGACATCCTAA

Protein sequence:

>DPOGS210716-PA
MVEREGVRRGGCGWGGGVLGRWSLRRLLLDSPLVGRRLAHSVLEQGERTVRVGTEQAQARPAVQYIAEDEEGEVLDCPLCLSTFPSASMVRLAWCSHRCCEPCLRQYLGIEIYEARVPVNCPVCHENMHPSDVRRIVDNPTLYDKYEEFSVRRALAADPDTRWCPAPDCSFAVVASGCASCPKLTCLSPGCGASFCYHCKAAWHPTQTCDAARRTRQQPPRQPQPSLDEHGEVKPCPRCAVLIVKVEDGSCNHMVCCVCGAEFCWLCMKEVSDLHYLSPSGCTFWGKKPWSRKKKLLWQLGTLAGAPLGIAVVAGVALPALLVGLPLWAGRRAHRRLRKARPARRRAAVAAAVAAALIVSPLVAGLAVGIGVPILLFYVYGVVPVSLCRAGGCAGGGEAPERPEQAIDLDATSRRLEPSIGDASLSAGSAQAEARAASLAGLAGSLGGQDAASPAAHRLEVTVDVFRAFASDTASAASWPDDLPSTSISSAARLRNLFLYGAPQPPPDPEAGGVEPRPDTPPTS-