Monarch geneset OGS2.0

DPOGS210525
TranscriptDPOGS210525-TA1320 bp
ProteinDPOGS210525-PA439 aa
Genomic positionDPSCF300186 + 261588-263066
RNAseq coverage1352x (Rank: top 9%)
Annotation
HeliconiusHMEL0163430.087.53% 
BombyxBGIBMGA012627-TA6e-17570.55% 
DrosophilaCG10565-PA8e-12452.87% 
EBI UniRef50UniRef50_E2ARS65e-12556.35%DnaJ-like protein subfamily C member 2 n=13 Tax=Pancrustacea RepID=E2ARS6_CAMFO
NCBI RefSeqXP_966597.26e-14861.45%PREDICTED: similar to MGC89351 protein [Tribolium castaneum]
NCBI nr blastpgi|3784663650.076.71%DnaJ-23 [Bombyx mori]
NCBI nr blastxgi|3784663650.076.71%DnaJ-23 [Bombyx mori]
Group
Gene OntologyGO:00055152.2e-13protein binding
GO:00036772.8e-13DNA binding
GO:00063551.9e-09regulation of transcription, DNA-dependent
KEGG pathway 
InterPro domain[376-435] IPR0090572.2e-13Homeodomain-like
[375-427] IPR0010052.8e-13SANT domain, DNA binding
[376-424] IPR0122871.9e-09Homeodomain-related
[377-423] IPR0147784.4e-09Myb, DNA-binding
Orthology groupMCL15214 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210525-TA
ATGAATGCGCGGTGGTCAGAGAAAAAACATGTGCCTTTGTTGGGAGACGAGAACAGTTCTCGGGAGTTTGTAGAAAAATTTTACTCTTTTTGGTATGAATTTGACTCGTGGCGTGAGTTTTCTTATTTGGATGAGGAAGAGAAAGAGAAGGGGTCTGACAGAGAGGAGCGGAGGTGGATAGAGAAACAAAACAAAGTGGCCAGAGCTAAACTGAAGAAGGAAGAAATGACGAGAATACGCAGTCTTGTTGACTTAGCGTACGCTAACGACCCCAGGATACAGAGATTCAAACAGGAAGACAAAGATAAAAAGATAGCTGCGAAACGTGCCCGCCAGGATGCAGTCCAAGCTAAAAAAGCTGAGGAAGAGAGATTAATTAAAGAGGCCCAAATTGCTAAACAAAAAGCTGAGGCGGCGGAGAGGGCGAGGATGGAGGCGGCTCGAGCTGAGAGGGAACTGCAAAAGAAGAATCTGCGCAAGGAACGAAAGTCACTTAGAGATTTATGTAAAAGTAAAAACTATTTTGCTAAAAACGAAGACGAAACCGTCAGTAATATGGCAGCCGTCGAAAAGATTTGTGAACTGTTGAAAGCCACAGAGATCCAAGCCCTGATCAAAGATATCGAGTCTAGTGGCCGGGATGCATTCATAAAGGCTATCACAGAGTCCGAAGAGAAGCTTGAAGCTGAACGCAGGGCTTTGTTTGAAAATAAAAGAGCTGAGGAGCAAAAAGCAAAGAAAAATGCAGCCCTTAAGGTTCCCATAGAATGGTCTCCCGAAATGATGCAGTTGCTCATCAAAGCTGTCAACCTATTCCCTGCCGGTACAAATGCAAGATGGGACGTCGTCGCTAACTTCCTGAATCAACACGGAACATTTACTGATGAAAGGCGTTTCAATGCTAAAGAAGTTTTAAATAAAGCTAAGGACTTGCAGAGTTCAGATTTCTCGAAGAGCATCCTAAAGAAAGCTGCGAATGAAGAAGCTTTCGATCAGTTCGAAAAAGACAAAAAGAAGGTTGTCAACTCGGTGGATGACAATAGTATATCCAAGAATGACACTCCCAAATTAGTGAATGGGATCTCCAAACCTAAAATGAATGGGGACGTCAAGGAATCCAAGGAAGAAAAGCCTTGGACCAAGACCGAGCAGGAACTTCTGGAGCAGGCCATCAAAACATTCCCCGTCAGCACTTCGGAGAGGTGGGACAAGATCGCCGAATGTATACCGAACCGCTCCAAGAAAGACTGCATGAAGAGGTACAAAGAGCTGGTAGAATTAGTCAAAGCCAAGAAACAAGCGGCCAACATCTCGAAATAG

Protein sequence:

>DPOGS210525-PA
MNARWSEKKHVPLLGDENSSREFVEKFYSFWYEFDSWREFSYLDEEEKEKGSDREERRWIEKQNKVARAKLKKEEMTRIRSLVDLAYANDPRIQRFKQEDKDKKIAAKRARQDAVQAKKAEEERLIKEAQIAKQKAEAAERARMEAARAERELQKKNLRKERKSLRDLCKSKNYFAKNEDETVSNMAAVEKICELLKATEIQALIKDIESSGRDAFIKAITESEEKLEAERRALFENKRAEEQKAKKNAALKVPIEWSPEMMQLLIKAVNLFPAGTNARWDVVANFLNQHGTFTDERRFNAKEVLNKAKDLQSSDFSKSILKKAANEEAFDQFEKDKKKVVNSVDDNSISKNDTPKLVNGISKPKMNGDVKESKEEKPWTKTEQELLEQAIKTFPVSTSERWDKIAECIPNRSKKDCMKRYKELVELVKAKKQAANISK-