Monarch geneset OGS2.0

DPOGS204526
TranscriptDPOGS204526-TA2187 bp
ProteinDPOGS204526-PA728 aa
Genomic positionDPSCF300297 - 356715-372570
RNAseq coverage5738x (Rank: top 2%)
Annotation
HeliconiusHMEL0087378e-5288.99% 
BombyxBGIBMGA004309-TA4e-12257.26% 
DrosophilaDek-PB5e-2169.23% 
EBI UniRef50UniRef50_D6WT431e-2478.26%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WT43_TRICA
NCBI RefSeqXP_973504.12e-2578.26%PREDICTED: similar to LOC398543 protein [Tribolium castaneum]
NCBI nr blastpgi|910876555e-2478.26%PREDICTED: similar to LOC398543 protein [Tribolium castaneum]
NCBI nr blastxgi|1947566982e-5828.51%GF11415 [Drosophila ananassae]
Group
KEGG pathway 
InterPro domain[646-699] IPR0148767.6e-17DEK, C-terminal
Orthology groupMCL22064 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204526-TA
ATGTCGGGTGATACCGATAAAGCGAAAATCAGTGACAATCAGGATGAAGACAAGAAATCCGGAACAGGCGACGGGCAGACGACGGAAGACGAATCATCCCAGGACTCGAAGGGTGAGTCGGCGCCAATGGACGCGCAAGGTAACAACGTAGCAGACAATATAGAAGGAGACGCGAAGTCAACGACGGCGAACGGCAAAGAAGATGACTGTCAGGGCGAGGACTCCAAGGATGATAGGGCTGTGAAGACGGAACACGCTAAGAAGGTAATGTACAAAAACGGTGAAACCCACGACGATGACGACAAAGAGGATTTGAAGGAAAATGACAAAGGGGCTAAGAAGTTAGCGCCGAAGAAGAAGCCCAAGAAAGAGGACACCGAAGAAGAGGAAGAAGATGAAGAGGAGGAAGATGAGGAGGAGGAGGATGAAGAAGGAGAAGAGGGCGATGAGGAGGAAGAGCAGAAGACAAAAGAGAAAGTAAAGAAACCAGTCAAGAAACCTAAAGACGGAGAGGAGGAAGGTGACGACGAGGAAGAGGGGGAGGAGGAGGGGGAGGAAGAAGAGGAAGAGGAGGAGGAAGAAGAGGCTCCGAAACCGAAACCCAAAAAGGAGCCAACTGAACCCCCTGTACCACTACCAGCTGGTAAAGGAATACCCTTGGGGCATATCAGCAACGTGGAAGTGTCACTGTCGCGCTTCAAGACCCAAGACCAGAAGATATTACACCAGTATTTGTACGGACAACTCTGCCTGGATCGCAACGTCAAAAGGAACATCAAGAAGTTCAAAGGCTACGAATGGGCGATCGGTTCCACGGAGTACAAAGCTAAACTAGAAGAAACAGCCAAAATGGAGCCCAAACAGTTGAGGACGATGTGCGAGATGTTGGACTTGGACAAAAAAGGTGGCGCCAGTGAGTTGGCTGCTCGTCTGGTCGGTTTCCTTCAGCAGCCGGTCGCGAACTCTCCCCACGCCCGCGGCGTTGCTCGCCCCCCTACCACGCAAGCGGCGACACCCGGCGGCCGACCGAGACGATCAGCGGCCGTCAAGATACACAACAGAGATATTAATATCACGTCTACTGTGGCGGCTGCGATGCCCCCTTCCCGCGCTCCCTGTGTTGTCTACCCCGAGCACACCCTTCCAACAGCCGAGGCCACTCCCCCGGCGGTTTCGCACGTGCCACTCGGACCTCAGTGTTCGTTAAAAAGTGTGCGGCGTCGTAGGTGTCACCACCCGCACCCCTATCACGGCGCGAAACACGTGGTTGTAACGATGTCGTGTAGCTACTCGGACGAAGAGTATGAATCTGATCCGGAGACCAAGGTGAAGGGTCCCAAGCAGCCCAAGGACGGCTCGGAGGACTCTGATGGCTCCTTCAACCCGAGCGGATCTGAGGCGGACTCGGACTTCGACCCTGAAGGTGGTGAGGGTGTGAGCGGAGCCGCGCGCAAGAGGAAGAGCTCTGGACGACGCCGGTCCAGTAAGGGGAAGAGGGGGCGCAAGAGCAAGGGGAGGAAGAAGGGTGGCAGCAGAGGCCGCGGCCGACGGGCACGGTCAGACAGCGAGGACGAGAGTGAACGCTCTGATAGTGACAGCGAATTGGACTCGGCCAGCGACGGAGACGAATCAGATGAACCGAAGTCCAAACGTGGTAGGCCCGCGGGGTCCGTGTCTAAGGGTCGCAAGGGAGCTGTAGCAAAGGCTAGCGCTAAAGCGACTCCCGCTAAGCGGAAAGCGCCTACACCCACAGGGAAGAAGAAGGCCGGTGCCAAGCCAGTCGGTAGACCAGCCAAGAAGGGCAAGCGGGCGTCCTCCGACGAATCTGGAGATGGAGAAGAAGGCAGCGAGGAAGAAGATGAAGAAGGCAGCGAGGAGGAGGAGAGCGGAGAAGAGGATGACGAGCCAACTGACAAGAAAGCCAAGCGTCCACCTACAGACGAGGAGATTAAGAAGTACGTGAAGCAGATCCTGGAGGGCGCGAACCTGGAGCAGATCACCATGAAGACGGTCTGCAAGCAGGTCTACAGCCACTATCCGGACTTTGACCTGGCGCACAAGAAGGACTTCATTAAAGCTACTGTCAAATCGAGACTAATGTTTAAATTAAGCTCATATCGTCGTGAGCGTGTGGTCGGCCGCAACCCTAGCACTAAGCACAGCACTAAGTCCGAAGATCCGGCATAG

Protein sequence:

>DPOGS204526-PA
MSGDTDKAKISDNQDEDKKSGTGDGQTTEDESSQDSKGESAPMDAQGNNVADNIEGDAKSTTANGKEDDCQGEDSKDDRAVKTEHAKKVMYKNGETHDDDDKEDLKENDKGAKKLAPKKKPKKEDTEEEEEDEEEEDEEEEDEEGEEGDEEEEQKTKEKVKKPVKKPKDGEEEGDDEEEGEEEGEEEEEEEEEEEAPKPKPKKEPTEPPVPLPAGKGIPLGHISNVEVSLSRFKTQDQKILHQYLYGQLCLDRNVKRNIKKFKGYEWAIGSTEYKAKLEETAKMEPKQLRTMCEMLDLDKKGGASELAARLVGFLQQPVANSPHARGVARPPTTQAATPGGRPRRSAAVKIHNRDINITSTVAAAMPPSRAPCVVYPEHTLPTAEATPPAVSHVPLGPQCSLKSVRRRRCHHPHPYHGAKHVVVTMSCSYSDEEYESDPETKVKGPKQPKDGSEDSDGSFNPSGSEADSDFDPEGGEGVSGAARKRKSSGRRRSSKGKRGRKSKGRKKGGSRGRGRRARSDSEDESERSDSDSELDSASDGDESDEPKSKRGRPAGSVSKGRKGAVAKASAKATPAKRKAPTPTGKKKAGAKPVGRPAKKGKRASSDESGDGEEGSEEEDEEGSEEEESGEEDDEPTDKKAKRPPTDEEIKKYVKQILEGANLEQITMKTVCKQVYSHYPDFDLAHKKDFIKATVKSRLMFKLSSYRRERVVGRNPSTKHSTKSEDPA-