Monarch geneset OGS2.0

DPOGS205421
TranscriptDPOGS205421-TA1758 bp
ProteinDPOGS205421-PA585 aa
Genomic positionDPSCF300407 + 339914-346212
RNAseq coverage1030x (Rank: top 12%)
Annotation
HeliconiusHMEL0223240.085.15% 
BombyxBGIBMGA001590-TA0.074.87% 
Drosophilatrbd-PA4e-2733.33% 
EBI UniRef50UniRef50_UPI00021A88B44e-9540.15%UPI00021A88B4 related cluster n=3 Tax=unknown RepID=UPI00021A88B4
NCBI RefSeqXP_397517.25e-9740.33%PREDICTED: similar to zinc finger, A20 domain containing 1 [Apis mellifera]
NCBI nr blastpgi|3800262753e-9640.51%PREDICTED: OTU domain-containing protein 7B-like [Apis florea]
NCBI nr blastxgi|3800262754e-9340.04%PREDICTED: OTU domain-containing protein 7B-like [Apis florea]
Group
Gene OntologyGO:00036776.1e-06DNA binding
GO:00082706.1e-06zinc ion binding
KEGG pathway 
InterPro domain[153-314] IPR0033232.9e-18Ovarian tumour, otubain
[559-583] IPR0026536.1e-06Zinc finger, A20-type
Orthology groupMCL14609 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205421-TA
ATGGTTATGGAAGCGAATCCCATTAATGAAAAACGAACAGATGATCGCGTCCACGCTATACGCACGCGAGCTGGTGGGGATGTGATTCCAGAACGTTCCGTGGGAACCTTGGACCTAAATATGTCATTCACACCATCGAATCTAGTCTTGACCCCGGCACCTGAATTGTCAACCCTAGGTCGGAAGCTGTCCCGTGGCATATCAAGAGCCACGGACAATGAGGGTCTGGTCTGGGCCTTGAGAAGCAACACTGAACCTCAACCGAATACCTTATCATCAGACCATATACTGTTATTGCCAGATATATCCGTCTACCCCCCAGATTTTAGAATATTCATCGAAAAAGATCTCCTAGAGACAGCGACTCTAGTCACGTTAGAGTCTGCTAATATACTCAACTGGTGGCGCCACGGTCGCGTTCCGGGCGCCCCACGGTTGCTGCCCCTGGCAACTTCCGGGGACGGCAACTGTCTTCCGCACGCGGCGTCGCTCGCAGCATATGGCTTCCATGACAGATTACTGGCTTTAAGGACCAAAGTCCAAGCTCTGCTCTCTGGAGAATGTGGCGATGTACTTACTAAGGCTATAAAACGTCGATGGCGCTGGTCGGAGAGCATCTCGCTTAGGACAGCAGGGTTGTCACCGTCGGAGGCGGAATGGGAGCGCGAGTGGCAGGACTCTATAGTGGCGGCGTCGGCTGAGCCTAGGCCGCACCAACCCTCAGCAGCTCCGCACTACGCCGGCTTGGAGCAGCTACACGTGTTCGCATTGGCCCACGTCATGAAACGACCGGTTATCGTGTTCGCGGACGTCGCGTTGAGAGATTTCAGAGGCGATCCGATAGCCCCTATCCCATTCGGTGGTATATACCTGCCCTTGGAACTGCAACCGGAAGTGTGCAGTAAAGCGCCGATACTGCTGGCTTACGACGCCGGTCACTTCTCAGCGCTAGTACCGTGCGAGCCTCTTCCCACAGACGGCGCTAGAGTACCCTTGGAAGACCGCATCGGAAACCCAATGCCTATACGATTCAACGTAGACCCGGGCGAGGACTTCCGATGGGACGTTGAACCGGAACAGAAGACTATCAACAACCTGCTCCCAGATGAATACCAACGGTCGGCCATGTTGTCCGCTTATTTGGACTTGGAGAGAGTCGAATGTCTATCTCAGGGTCAACCGTCGGAGGAGCTGAGGAGGTCCCTCGACGCATTGTCCACTAAGAGTTCCAAGCAGTTGAACTCCGTCGCCAAACAATTTGGCAGCATCGGGAGGTCAATGAGCAGCAAGTTGAAGAGCTTCGGATCTATGGCCAAGTTAACGAAGACCGGAAGCCAGTCGAACCCGGAAGACGGTCTGATGCGGCGGCAGTCCACTTGCGAAGTACTGTGCTGTAGGGTCCTAGCGGCTAGAGCGCCCGTACAGGAAGAGATGGTCAAGAATTACCTGAATGAAGCTTGGATCGGATACACAGCGGAAATGAATAGGAAAGAAGAATGCCAGGCTCCAGCGAAGCCTCGCTATGGCACCGGTCGGTCGCAGTTCTACGCGGAAGCTGATAGGAACGCTCATGAGAACGCTAGAACATTGACCACTAAGTCCGCGAAACCGGCCCAAGATCGAACCTTGTACCTCTCCAAATCCACTTTCTACGACGACCGCCCTCCTTCCCCCAAACCCTGCAAGGCCCCGCTCTGTATGTACTACGGCAGCCCCGAAAATAACGACTACTGTTCTCGTTGCAGCAAACTCCAATGA

Protein sequence:

>DPOGS205421-PA
MVMEANPINEKRTDDRVHAIRTRAGGDVIPERSVGTLDLNMSFTPSNLVLTPAPELSTLGRKLSRGISRATDNEGLVWALRSNTEPQPNTLSSDHILLLPDISVYPPDFRIFIEKDLLETATLVTLESANILNWWRHGRVPGAPRLLPLATSGDGNCLPHAASLAAYGFHDRLLALRTKVQALLSGECGDVLTKAIKRRWRWSESISLRTAGLSPSEAEWEREWQDSIVAASAEPRPHQPSAAPHYAGLEQLHVFALAHVMKRPVIVFADVALRDFRGDPIAPIPFGGIYLPLELQPEVCSKAPILLAYDAGHFSALVPCEPLPTDGARVPLEDRIGNPMPIRFNVDPGEDFRWDVEPEQKTINNLLPDEYQRSAMLSAYLDLERVECLSQGQPSEELRRSLDALSTKSSKQLNSVAKQFGSIGRSMSSKLKSFGSMAKLTKTGSQSNPEDGLMRRQSTCEVLCCRVLAARAPVQEEMVKNYLNEAWIGYTAEMNRKEECQAPAKPRYGTGRSQFYAEADRNAHENARTLTTKSAKPAQDRTLYLSKSTFYDDRPPSPKPCKAPLCMYYGSPENNDYCSRCSKLQ-