Monarch geneset OGS2.0

DPOGS202412
TranscriptDPOGS202412-TA1476 bp
ProteinDPOGS202412-PA491 aa
Genomic positionDPSCF300233 + 30272-33264
RNAseq coverage450x (Rank: top 27%)
Annotation
HeliconiusHMEL0036901e-9067.90% 
BombyxBGIBMGA003438-TA0.069.90% 
DrosophilaCG4069-PA1e-9340.74% 
EBI UniRef50UniRef50_G3MHD52e-11043.23%Putative uncharacterized protein (Fragment) n=1 Tax=Amblyomma maculatum RepID=G3MHD5_9ACAR
NCBI RefSeqXP_001600405.12e-11144.99%PREDICTED: similar to kelch repeat protein [Nasonia vitripennis]
NCBI nr blastpgi|3838660336e-11144.47%PREDICTED: kelch domain-containing protein 4-like [Megachile rotundata]
NCBI nr blastxgi|1565414874e-12044.81%PREDICTED: kelch domain-containing protein 4-like [Nasonia vitripennis]
Group
Gene OntologyGO:00055151.5e-45protein binding
KEGG pathwaymcc:6981404e-12 
 K01787 (RENBP)maps-> Amino sugar and nucleotide sugar metabolism
InterPro domain[403-462] IPR0159151.5e-45Kelch-type beta propeller
[181-226] IPR0066526.5e-08Kelch repeat type 1
Orthology groupMCL13405 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202412-TA
ATGGGTAAAAAGAAAAATAAAAATAAGGTTAGCGGTGCAGTCAAAACTGCTGCCAAAACAGAGAAGAAGTTGGCTAATAAATTAAAAAAGGAACTTGCGAATTTAGGCGAGGAAGATATAGCAAAAGTAATAGCCGAAATCGAGAAAGAAGAAGCGAAACGCGCGGCAGCTTCTGAGAAGATATTGTCTCAGCCGCCTTCTGCTCGTGCCTACGCAAGCCTGACGCCGCATCCCACTAACAATGAACTTATTATGTTTGGAGGAGAGTTTCACAATGGACAACAGACAGAAGTTTACAATGAATTATTATTCTTCAATCCTGTCACTGGTGTTTGGAGACAAGTGAAGGCACCTGGCGCTCCACCTCCAAGAAGTGCACATCAGGCTATTGCCCTTCCATCAAATAAGGGAGAGCTCTGGATCTTCGGGGGAGAATTCACTAGTCCGACAGAAACTCAATTTCATCATTACAAAGATTTATGGTGTTTCTCACTTGCTGAGAAGAAATGGGAAAAGGTCATAGCGCCAAACGGCCCTTCTCCACGTTCCGGTCACCGTATGGTTCAGCTTGGACGGAAGCTATACGTCTTCGGTGGATACAATGATGATGGACGCGAGTGCAGATACTTCGACGACCTCTACCAATTCTGTCTCGACACACGAACGTGGACGAAACTCACCGTCAGTGGTAGGGGACCCTGCGCGAGATCCGCTTGTATTATGTTGCCCGTGGGGAATGAAGCCCTTATAATCTACGGAGGTTTCTCAAGAGTTCGTGAGGGACGGACTGAACACACACAAACGCATACAGACATGTTCAAATTGAGCGTCAAAGGCACCGCACATACGTGGCGATCACTCTCCGGCGGGAATAAGGCCCGCGCCGGACTGGCGGCGGCGGTTAACGTTCACAGTAACAGAGGATATGTTTTTGGAGGAGTGTGTGATGTTGAAGAAACCGAAGAGGAACTTCGTGGTGAGATGAGTGACGAGCTTCAAGTATTGGATTTAGAGACATTTAGGTGGCATCCAGTACTATTGAAAACACAGACACAAGCACAGAGCGCACCCGCACAGAGTGCGCCAGCACAGGATACCCACGGTTCAGATGATACTAAGGAATCTGTTACAGTGGTAACAGATGATGTGTTCACTATGAAGCTTGGTACGGCCCCAAGCATACAGTCGCCCACACCAGTTCAGCAAACAGAAAGAGTTGCTTCGGGTCCGACCGCTCGTATGTCTGCTATGATGACAGTCCAACGATCAACACTTTATGTGTACGGAGGTGTCCTTGAAAAAGATGACAAACAGTTCTATCTCGGAGATATGTACAGCCTAGATTTGCACAAACAGAATGAATGGAAGACAATCATAGAACAACCATCGCTTCCAGACTGGCTCGGAACGGATTCAGAATATGAGACGAGCTCGGACAGTGGAACAGAAGACAGTGATGACTCCGATGATGAATAG

Protein sequence:

>DPOGS202412-PA
MGKKKNKNKVSGAVKTAAKTEKKLANKLKKELANLGEEDIAKVIAEIEKEEAKRAAASEKILSQPPSARAYASLTPHPTNNELIMFGGEFHNGQQTEVYNELLFFNPVTGVWRQVKAPGAPPPRSAHQAIALPSNKGELWIFGGEFTSPTETQFHHYKDLWCFSLAEKKWEKVIAPNGPSPRSGHRMVQLGRKLYVFGGYNDDGRECRYFDDLYQFCLDTRTWTKLTVSGRGPCARSACIMLPVGNEALIIYGGFSRVREGRTEHTQTHTDMFKLSVKGTAHTWRSLSGGNKARAGLAAAVNVHSNRGYVFGGVCDVEETEEELRGEMSDELQVLDLETFRWHPVLLKTQTQAQSAPAQSAPAQDTHGSDDTKESVTVVTDDVFTMKLGTAPSIQSPTPVQQTERVASGPTARMSAMMTVQRSTLYVYGGVLEKDDKQFYLGDMYSLDLHKQNEWKTIIEQPSLPDWLGTDSEYETSSDSGTEDSDDSDDE-