Monarch geneset OGS2.0

DPOGS203861
TranscriptDPOGS203861-TA2295 bp
ProteinDPOGS203861-PA764 aa
Genomic positionDPSCF300010 + 3526938-3536770
RNAseq coverage1048x (Rank: top 12%)
Annotation
HeliconiusHMEL0103140.064.17% 
BombyxBGIBMGA009668-TA0.059.92% 
DrosophilaCG7946-PA4e-2630.80% 
EBI UniRef50UniRef50_E2B6033e-7734.88%Hepatoma-derived growth factor n=8 Tax=Formicidae RepID=E2B603_HARSA
NCBI RefSeqNP_001040548.17e-7179.31%hepatoma-derived growth factor-related protein 3 [Bombyx mori]
NCBI nr blastpgi|3072134421e-7634.88%Hepatoma-derived growth factor [Harpegnathos saltator]
NCBI nr blastxgi|3072134423e-8132.62%Hepatoma-derived growth factor [Harpegnathos saltator]
Group
KEGG pathway 
InterPro domain[407-527] IPR0215676.7e-22Lens epithelium-derived growth factor (LEDGF)
[7-68] IPR0003131.2e-12PWWP
Orthology groupMCL15576 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203861-TA
ATGGGTAAAAAAGTTCGTGAGTATAAGTCCGGCGACTTTATATTTGCAAAAGTAAAAGGATATCCAGCTTGGCCGGCGAGGGTTCAAAGACTAAATGGGAAAAAGTATTTTGTATATTTCTATGGAACTGGTGAAACAGCCAACCTCCCTCCGAACATGATATTTGACTATGCAGAAAATAAAGATAAGTTTCTAACGAAAACTGTAAAAAGACGTGATTTCAATGACGGTGTTAAGCAAATAGAATATGACTTCGCCAATAATGTGCCGCTGGAACAAGTTATTGGGCTCGTCTCTGAAACTCCAGAAAATGTGAATAACACTATCAATGATACATTAAATGAAACAGCCGATGAAACTATGGCTGATACCACAGCAGATGAGAGCATAGCTGAAGAGAACACCACCCAAAATGACACAGCAATTGAAGATTCTGATGAAACAGGAGGCCTAATAATTGATGAAGGAGGCAAGCCAAAACCAGCGAAACGTGGTGCGAAAACACCGGCTAAGGAAACACCGAAACCCAAAGAAGTTAAAACACCAAGGAGGGGCAAGAAGGAAGAGACAGAGATAAAAGATGAAAAGAGGGACGAGGAGATTGTTAGTAGGAGTGGAAGGAAAATAAGACCTAAGAGGTACATAGATGAGCATACGGAGGAAAATTCAACACTACCGTCACCAGCGCCGAAGAAACGTAGGGCTTCGCCAATAGAGAACGAAAAAGAAAATACTCAAGAAAAAGACAGCGTCAAACAGTTCAATGTTGTGACACAGAATGAGCTTGAGGATTTGAAAGAGCCGTTTGCCCCAGGCGATCTGTTAAGTTCTATGATATTAAGTCAATACGACGACAATTTCGCACAGAATTTACTATTGGACTATAATCGCAACGAAGATGACGTGAATATAAGTACGGAAGATCCAGAAAAGGACACTATAATAATCACGTATCTCCCATCCGGTCAGTATGTCGGTATAAAGTTGTTCCAATCGAGGCCCAGTTTCAAGAATGAGGCCTCAAGGCTTCAGTGGGACAAACAGGCGGCAAGCAATGCACTCACATTGAAGATGCAATTAGAGAAAGGTCAGATTACAGCACAGTCTGTGATCGCTCAGTTAGTTATGGATCTCAACTTGTCCGATCAAGAAAAGGCAATGTTTGACAAGGAAAGGGAAACAGAGGAAAAGAAATCTCGTGTACAATTTCTAAAGACTGAAATGAAGCTCATAGAGCTCGATGCCAAAATCAAGACATGTCTCTGTTTGGAGAAAGCTGATACAGAATTATGTTTGAAGTTGCTCGATGAACTTATGGAACTTGAATTAAAGCCGCTCATGCTGTTGAAACATCCATCATGTCTGGAGACTATCAAACGCATGAGGGCATATGTTGGCAACACTCCATCGTGGGAATTAAGCGAAGAGGCCGTGTTACAGTTCAGCCAACACGCTGGCAAGATAAGGAGACAGGCGGACGTTTTGTATAACAATATGAAGGAACTGTTCCCGACACTCGAAGGGTTATCGTTTTGGGAGTTCTTCACAGAACGTGTCAGTCAATTCAAAAAGGCAACATCTAAACTGAGCTCTGATGAACTGCTGGAATTAGTTCACGAGCCTTTGGAAATGTCGGTACCAACATCACACACAATGAAGTCGGCTGTTGAAGCTGCGAACGAAGATGAAAACGAAGAGTCGAAGAAAAAACCTCCAGTCAAATCAAAGAAAGTTAATAGCACTCCCTCAAAACCGCCGTTAAAACGACAATCGTCGAGGAAACAGCAGCAGGATGAGAAGGAAAAGGAGAAAGAGACAAAACCTGAAGAACAAATAGAGAATACAAAGAAAGAAGAAGAGAAAGATGTAAAAGATAAAGATGACAAAGATAGTAGCGAAATAGAAAAAGATAGTAAAGAAAACGAGATTACACACACGGAGAAGTCAGAGGATGCTAACACAGAAGTTAATGAAGCAAAAGATAAAGATAAAGAAGAAGTAAACGAAAAGGAAGCCGCCGATGAGACCCAAGATAACGCAGAAAAGGAAAGTGAAAAAGACAAAGCACAAAACGATTCAGATAAAGATAAACAAGAAAGTACAGAGTCAGATAAAGATAAAAACGATAAAGATGTTAAAGAGGTCAAAGGAAAGAACAGCAAAGAAGACAAGAGTGAAAAGAAAGAAGAAAAAGAGAAAGTCGAAGATGAACCGAGAGCGAAACGGACGAGAGAGAGTAAGAAGACTGATCCGCCGCCGCGTTCACCGACGAAAAGAAAAGCCAAAATGAATTGA

Protein sequence:

>DPOGS203861-PA
MGKKVREYKSGDFIFAKVKGYPAWPARVQRLNGKKYFVYFYGTGETANLPPNMIFDYAENKDKFLTKTVKRRDFNDGVKQIEYDFANNVPLEQVIGLVSETPENVNNTINDTLNETADETMADTTADESIAEENTTQNDTAIEDSDETGGLIIDEGGKPKPAKRGAKTPAKETPKPKEVKTPRRGKKEETEIKDEKRDEEIVSRSGRKIRPKRYIDEHTEENSTLPSPAPKKRRASPIENEKENTQEKDSVKQFNVVTQNELEDLKEPFAPGDLLSSMILSQYDDNFAQNLLLDYNRNEDDVNISTEDPEKDTIIITYLPSGQYVGIKLFQSRPSFKNEASRLQWDKQAASNALTLKMQLEKGQITAQSVIAQLVMDLNLSDQEKAMFDKERETEEKKSRVQFLKTEMKLIELDAKIKTCLCLEKADTELCLKLLDELMELELKPLMLLKHPSCLETIKRMRAYVGNTPSWELSEEAVLQFSQHAGKIRRQADVLYNNMKELFPTLEGLSFWEFFTERVSQFKKATSKLSSDELLELVHEPLEMSVPTSHTMKSAVEAANEDENEESKKKPPVKSKKVNSTPSKPPLKRQSSRKQQQDEKEKEKETKPEEQIENTKKEEEKDVKDKDDKDSSEIEKDSKENEITHTEKSEDANTEVNEAKDKDKEEVNEKEAADETQDNAEKESEKDKAQNDSDKDKQESTESDKDKNDKDVKEVKGKNSKEDKSEKKEEKEKVEDEPRAKRTRESKKTDPPPRSPTKRKAKMN-