Monarch geneset OGS2.0

DPOGS210522
TranscriptDPOGS210522-TA1797 bp
ProteinDPOGS210522-PA598 aa
Genomic positionDPSCF300186 + 248132-253319
RNAseq coverage477x (Rank: top 26%)
Annotation
HeliconiusHMEL0163400.076.22% 
BombyxBGIBMGA012625-TA0.060.51% 
DrosophilaCG5645-PA4e-9745.10% 
EBI UniRef50UniRef50_D6WTE22e-11647.13%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WTE2_TRICA
NCBI RefSeqXP_973472.21e-11647.13%PREDICTED: similar to CG5645 CG5645-PA, partial [Tribolium castaneum]
NCBI nr blastpgi|3504198103e-11644.44%PREDICTED: protein KRI1 homolog [Bombus impatiens]
NCBI nr blastxgi|2700102582e-15044.79%hypothetical protein TcasGA2_TC009637 [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[3-577] IPR0180343.2e-147KRR1 interacting protein 1
[272-365] IPR0078511.1e-24KRR1 interacting protein 1, subgroup
Orthology groupMCL12978 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210522-TA
ATGACTAAAAAAAAGAAGCTTTTTGATGAGGAATCTGATGAGGAAGTTACCTTAAAAACAGAAAACGAATACGCCAAAAAGTACGACACCTGGCGAGAAAAAGAAGAATTGTATAAACTGGAACAAAAATATGGTTCCAAAGCACTTAACTCGGATGCGTCTGTCTCTTCTGACAGTGAAGACGAAAGTGACGAGGCTCCAGAAATATCGGAGGAAGTGGAGAAGCAGTTCCTGAAGACCCTATCACTGTTGAAAACCAAGGACCCTAGAATCTATGACCCCAATTACAAATTCTTTGATGAGAGAGTGGAGAAGGAAAAAGAAAAGGAACCAGAGACTAAAAAGTTAACTTTTGGTGAAAGTGATGATGACGAAGATGATGGAAATATCTTTAGCATTGAGAAGAAGGCGGATATAGAGGACCCAGGGGGTGGTGCTCAGGAGCTGAAGGATCAAAAGAGTGACCTCAAGGCATTCCTGACGGGGAGTGTAGAGCATGTGGAGGATGAGGCTGAGCTGGCTCCGTTGCGTGCTCTGTGGAGCGACCCAAACCTCAACGAGGGAGAGGCCTTCCTGAGAGACTACATACTGAATAAAAGATATCTCGAGGACGGTCCAGCAGCAGCGAGCCAGCTCCGAGACGACGAGGAGCTGGAGGAAGACGAGAAGAGAGTCGAGGAACAGGGGCAGTTCGAGAGAGCCTACAACTTCCGCTTCGAGGAACCCGACCAGGAGTTTTTGAAACGCTTTCCCCGCACAATGAACCACATCCGGCCCAAAGATACGAGCAGAGCCAAAAAACGAGCCGAAGTGAAAGAGAGGAAAGAGAAGGAGAAACAGAGGAAGATGGAAGAGATCACGAGGATGAAGGCGCTCAAGTTGAAGGAAATCAAGGAAAAGATCGCTAGGATCAAGGAAGTCACCGGCAACGAGGAGCTGGCGTTCAGGGAGGAGGACATAGAAGGCGACTTCGACCCCGAGGAACATGACAGAAGAATGAAGGCTCTGTTTGATGACGAGTACTACGGAGATGTGGACGAACAGAAACCAGTGTTCCCTGACCTGGACGAGGAGCTGGAGATCGAGAACTGGGATAAATACGAGCATGAAGAGAACGCTCCTGACGAGAATGAGCATGACGGACCACACTGCGAGGACGAGGACTTTAATATGGACGCAGACTACGACCCGAAGAAGGCGAGGGAGAGTTTACTGGAGGAACTGACGAGCAACATGGCCAAGAAGAAACGGAACAGGAAGAAGAAGTCCAAGCTGGCCGAGCTGCTCTCGGAACAGAAACCTAAGTTCGTACCGGAGGTGGACAAGACATACTCCCAGTACATGGAGGAGTATTACAAGATGGACTGCGAAGACATCATCGGAGGGGACCTGCCCACCAGGTTCAAGTACAGGCAGGTCGTCCCCAATAACTACGGCCTGACTGTGGAGGAGATCCTTCTGGCGGACGACAAAGAGTTGACCCAATGGGTCCCTCTCAAGAAGATCGTCAAGTACAGACCAGAGAATGTTGAGAAGAGTGAAGTTCACTCATACACACAGAAAGCGGCCGACGAGAGGCTCAAGAAGAAGATACTGCCAAGTCTGTTCCGAGGAGTGCCGGATGAACCAGAAATAGTTGTCCCATTAGAGAAGACTATCAAAAAGAAGAAAAAGAAGAAGAAGAAACAAGAAGATGTAGAAAATAATGAAATTGATAATGAGGGGTCTAATGATATTGATAATAATGGAAATAATGATAGCGTGGAAGATAGTGATGACGAAGAAGAAAAGGAATAA

Protein sequence:

>DPOGS210522-PA
MTKKKKLFDEESDEEVTLKTENEYAKKYDTWREKEELYKLEQKYGSKALNSDASVSSDSEDESDEAPEISEEVEKQFLKTLSLLKTKDPRIYDPNYKFFDERVEKEKEKEPETKKLTFGESDDDEDDGNIFSIEKKADIEDPGGGAQELKDQKSDLKAFLTGSVEHVEDEAELAPLRALWSDPNLNEGEAFLRDYILNKRYLEDGPAAASQLRDDEELEEDEKRVEEQGQFERAYNFRFEEPDQEFLKRFPRTMNHIRPKDTSRAKKRAEVKERKEKEKQRKMEEITRMKALKLKEIKEKIARIKEVTGNEELAFREEDIEGDFDPEEHDRRMKALFDDEYYGDVDEQKPVFPDLDEELEIENWDKYEHEENAPDENEHDGPHCEDEDFNMDADYDPKKARESLLEELTSNMAKKKRNRKKKSKLAELLSEQKPKFVPEVDKTYSQYMEEYYKMDCEDIIGGDLPTRFKYRQVVPNNYGLTVEEILLADDKELTQWVPLKKIVKYRPENVEKSEVHSYTQKAADERLKKKILPSLFRGVPDEPEIVVPLEKTIKKKKKKKKKQEDVENNEIDNEGSNDIDNNGNNDSVEDSDDEEEKE-