Monarch geneset OGS2.0

DPOGS200462
TranscriptDPOGS200462-TA5292 bp
ProteinDPOGS200462-PA1763 aa
Genomic positionDPSCF300260 + 202770-220800
RNAseq coverage345x (Rank: top 34%)
Annotation
HeliconiusHMEL0145640.056.58% 
BombyxBGIBMGA011407-TA0.050.52% 
Drosophilaegg-PA6e-10148.26% 
EBI UniRef50UniRef50_E9J2540.039.38%Putative uncharacterized protein (Fragment) n=1 Tax=Solenopsis invicta RepID=E9J254_SOLIN
NCBI RefSeqXP_001603698.10.038.43%PREDICTED: similar to histone-lysine n-methyltransferase [Nasonia vitripennis]
NCBI nr blastpgi|3504036200.039.40%PREDICTED: histone-lysine N-methyltransferase SETDB1-like [Bombus impatiens]
NCBI nr blastxgi|3504036200.039.62%PREDICTED: histone-lysine N-methyltransferase SETDB1-like [Bombus impatiens]
Group
Gene OntologyGO:00056342.7e-18nucleus
GO:00082702.7e-18zinc ion binding
GO:00349682.7e-18histone lysine methylation
GO:00180242.7e-18histone-lysine N-methyltransferase activity
GO:00036776.8e-18DNA binding
GO:00055151.1e-09protein binding
KEGG pathwaynvi:1001200140.0 
 K11421 (SETDB)maps-> Lysine degradation
InterPro domain[1413-1524] IPR0077282.7e-18Pre-SET domain
[1312-1431] IPR0161776.8e-18DNA-binding, integrase-type
[1411-1516] IPR0036062.6e-16Pre-SET zinc-binding sub-group
[1339-1395] IPR0017399e-10Methyl-CpG DNA binding
[1543-1687] IPR0012141.1e-09SET domain
Orthology groupMCL11836 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200462-TA
ATGGCATCAAAACAAAACATGGAAGATGAAGAAAATCTCGTTGGAAAAAAAGAGCTTGATGATACGACGAATGTTAATGATCAAAAAGACGGAGAGGTGTACGAAAGCGTTGATGATGATATGGAATTAAAGTGGGAAGATGATGATATTGACGACGTATCGATCACAAATGAAGACGCGCTTCTCGAAGATGTAGCTATGGACAATGATGATAAATTATTACCTTCCGACTCTGTAATACCGGTAGCTAGCCAAGAAAGTATCACGTATGAGATCAATCCTAATGAATTTATAAATAAAGCGAATTTAGATGAAACGTTTGAACCGGCGAAAGCGCATGGCTTAAATAAACCCGACATGATGCTGGAGATACTAACTCATAACCTTAGCGATTTAAGTGACGATGAAGATCTTACAAATCTTAAGATGTCGCCTGACGTCGATTTGGAACGGTGCAGTCCTAACAAAGATTCACAGATTGAAAATAAAGCTGACTTCATGGAAACTGATTTGAATGATGATTTTGATAGAATTTCATCACTAGTTCATGAAAATGTTGATGATGACTTAGATAAAGGCGCAGATATTTCAATGTGTGAAGACAGCAAGCCCGCAGAGACATCACTATCTAGGAAACTTAGTGTGAACGATGACTCCATTGATGAAGATATTCTACTTGCAGATGATGATAAAGATGAACAGGATGAAGGTGGAATGGAAGAACTATTGGATGATAAGATTGATTTGGATGCTGTTGATATATTAGAGATTAATTCTGAAGAGAAGTTGGAATTAGAAAGTGAGAAAAATAAGCTATTACAAAATATTCCCGATACAGATGGTCTAAAACTTAATAATGAGGAAGACGTGTCAGGAATTAAAGCAGAGTGTGGAGCTGAATGTAAAGAAGTAGACGAGATAATAAACGTCCCGAGTCCAAAACCAGAGATATCAGAAGCACATGTACCTAAAGTTATAAATGCAGATCAATCCAATGATTCACCAACTTCAGAAAAGGAAGTTCAGGTCTCAAAAACTGGTATAAAAAGGAAAAAATTATCTTTGAGACTGAGATTGGATAAAACACAGAGCACAGGATCTGATGTGATTATGGATGAACACACTTCATTGAAATCTGATGGATCAGAAGTTGCATTAAGGGACCCGGGAAATGATCAAATTAATACAAATAATGCTCACTTGACCTCCGAAGATGTAGCAAATCAACCAGACATTGCAAACAAAGATTTAATTACCGCAGCAGACTTGGAATCCGAACATACTAAGAGAAAGAAAGATTCCGAGGACACTAGTTCACAACTACCAGATAACGATGTGTCATTAAAACCTAAAAAAGACCTGAAATCTAAGGAGAAGAAATTAACCCCTGATATAGAACCGCAGCCATCGACGAGTGGTTCGAAAAACATTAAAATTAATATTGAATCAGCGTCAAAGAGTGACAGTCGACTAACTTCAAACATCTCTAAACTAAGTTCACCGGCAGAAGACATTCCTGGAACAACTGATAATCTTGACCTCCTGGCTGAATCGTCGCGCGTGACACATGACGATGAAGCAGAAGATGAATATATGGATGACGAGGAGGGAGAGGATTTTGAGCAGTTTGACGAAAGCAGCAATCAGATGGCAGCGGAGCAGTCCGAGGATTCAGAGCAGCATCACTCGGATAACGCTCACGAGACAACACACAGTAATGAAAAGGAATTCAGCTTTACCATCACTGATGTCGTCACTGAAAATGTAGTTAAGGTGGACATTGAGAATCAAGACAGTATTAAGTCAGAGAATTTGGAATCAGTGCCGATAGGAAATGTGTGTCAGAACATGGACGTGTCCAAGGTTGGAGGAATAGAAACCGATTTTGGAAATGAGAATAAAAAGTTGACATCGGAAGAAGAAAAGAATCTGGATTATCAAGACGAGACAAAAGATTGCACCGATGTGAAAAAATCAGAGGCCCTGAGCTATGTTGAGTTAGAGGAGAGCTCGGAAGAAGGTAATCATGAGTTAGATGCTCTAGACGCAAATAAAACTGATGAAGTAGGCAATATGAATACAACACAAGATATCAACGAAGATGAGCCAGCCAAGACAGAGGACACGGGACTGGATGACAGTAACACACTAGTTGAGGATACTACGAACCAAGATACAAAAGCATTGGATAAAGATGACAAGACGAAATCCCAAGGTTTGGAAGTATTCAATCTAGACTCGGACGAGGAGGATGTTGGTGAAAAGAATAAAACGGACATTTCCCATCAAGAAACCCCTGAGAATCCGAAGCCCCAATCCCAGTGGGTGAAGTGCATCAACAAGTCCTGTGCCAACACATCGTCAGACTATTACAAGGCTGACGGCATCACAGTCAACTTCTATGACCCGGAGAGAAAGAAAAGAGGCTATGTTTGCCAAACCTGTCTCAATTTGGTGGAAGAGAGGAATCAGTTGTTGATCAGCGGCATCAAGTCCCTGGTGCCGCTGCTGAAGCTGGAGCCCGGCCGGCCGGAAGAGGATCTGGTCGAGATATCAGACTCGGAGTCCGAAGACGAGGCGGAGCCGGAGGACGACGATGACGTCATAGGAGTGGAGGGGGCTAGGGTGATAGAAGAGAAGTTGACTGATGTCCTGAACGAGACGTGGGTGAAGTACAACTTGGATGACCGGCTGCAGGAGGCACAGGACCAGCTCAAACAACAGCTGGAACAGCTGCAAAAGGACAGTTTGGAAATCAACCAGCTCCTAGACGAGTGCCAGCTATCCACAGACAAGCTGCGATCAGAGCTCTACTCTAGCTTCGAGCGCGACATTAAAGAACTCCCATCGCTTCTAATATTCGACGTGCCTAATTGCTCTTACACCTGCGTCGATCCATCCGGAGAGGGAAGCAGACTACTGAAGCGCAGGAAGTCATCTGTATCCGAGTCCCCGGCAAAGAAATCTGCATTGTCAACAGGCGATCAAGACACAAACACCAAAGACATGACAGACGAGAAAACGGAAGAGGATAATCCTGATGTGTCTGTGGTACATCTCTCCGTGGAATCCGCGCCGCCCGACCTTCCTCCCGCGGGGGAGGTAACCTACCCCCCCTTAAGAGTGGGGATGACGATCTACGCGTCCAAAAATGCCCTGGGTTCCTGGATGAAAGCCAAAATTGTAGAGATCACTCCGAAATCATCACTTCCGAACTGTTTTACGCTGTGTCGCGTCAAGTACGAATACAAACAGTCTAAGCCAACCAAAATATTACCAGCGAGGTGTATCGCCTACATAGACCCACCAGACGTTAGAATGACTATAGGTACCCGTGTGATAGCTCTGTTCAAAGACATAACCATGAAGGAGTCCTTCTACCCGGGGATTGTTGCTGAAATACCGAACCCAGTCAACAATTACCGCTACCTGATATTCTTCGACGATGGCTACTCTCAATACGCGCCGCACTCTAAGGTCCGTCTGGTGTGCGAGTGCGCGTCTCACGTGTGGGAGGAAGTACAGCCCAAGTCGCGGGAATTCGTCCGAAAATATCTCCTGGCTTACCCTGAGAGACCCATGGTGAGGTTGCACCCTGGACAGAGCTTGAAGACGGAATGGAAGGACAACTGGTGGTCATCCGTGGTGGTGTCGGTGGACGCGTCGCTGGTGGAAGTCCAGTTCCTCCAGCTGGACAGACGAGAGTGGATCTACCGAGGATCCACGAGACTCGCCCCCCTGTACCTGGAACTGCAGGCCGCGGAGAGACACAGGCCCAGGGCCCTGCCACGGGCACAGACCACGAGGACGAACATGCCCTACGTGGAGTACACCAGATCTGAAGAACAGACGAGCAAACAGGCCGAGACTTCGCCACAGCAACAACAGAGTGAGTACTACACGCCGAAGAAACAGGTGAAGCCGTACAAGATGGTGCCACACACTTGCTCGCCGGCGTGCAAAAGAACGGATGTTCTGGCACTTAAGGATTTGAGAACTTATAATCCGTTAGCCAAGCCGCTACTGAGCGGCTGGGAGAGGCAGATAGTTCTTTTCAAGGGCAACAAGGTTGTGTTGTACGTGTCTCCGTGTGGTCGCCGCATCCGCTCTCCGCGGGAGCTACATCGCTATCTGCGGACCGTTGGGTCAGACCTGCCAGTCGACCTCTTCGACTTCACACCATCCACGCACTGTCTGGCCGAGTTTGTGCTCAACAAATGCTACGTTGGCAAAAAGGATTTGTCCCATGGCAAAGAGAACGTCCCAGTGCCTTGTGTCAATTACTACGACGAATCACTGCCAGAGTTCTGTTCCTACAACACTGAGCGGACTCCGACCGCTGGGGTTCCACTCAACCTGGACCCGGAGTTCCTGTGTGGCTGTGACTGTGAGGACGACTGCGAGGACAAGAGCAAGTGCGCCTGCTGGCAGCTGACTCTGGAGGGCGCCAGGACGATAGGTCTGGAGGGGGAGAACGTCGGTTACGTTTACAAAAGACTGCCAGAACCACTGCCTAGCGGTATATACGAGTGTAATTCGAGGTGTAAATGTAGAGACACGTGCCTTAACCGCGTCGCTCAACATCCGCTGCAGCTGAAGTTACAAGTGTTCAAGACCCTCAACCGCGGGTGGGGGATTCGCGCCCTCAACGACATACCGAAAGGGGCCTTCCTTTGCGTCTACGCTGGAAATTTGCTCACCGACGCTACAGCAAACCTTGACGGTCTGAACGAGGGTGACGAGTACCTGGCGGAGTTGGACTACATCGAGGTCGTGGAACAGATGAAGGAGGGTTACGAAGAGGACATACCAGAGAACATCAAGAAGATGGATGAGGCGGAAATAGCGAAACAGCAGTTGATGCCGGACGACGAGATGGAATCCTCGTCATCAGAGGAAGGGAGCAGCACCAAGAACGGCGAGGAAGACGATGACTTCAGTCCCGGATACATCGGCCTGGGTGTAGCTAAAGAAAAGTCTATGGCCAAAGACAAGGATAAAACCGAAGCGAGGAAGGAGAACGAAGAGGATTGCATCACCATCAGTGATGATGAGGAAGTTCGAGAACCTTCAAACTTCACGGCCGCTGCTGGGATGGGAGCAAACGAATTTAAATCAAAATATAGGTCTGTCCGTAGTCTGTTTGGTGAAGATGAAGCCTGCTACATCATGGACGCCAAGGTAGCTAAAGAAAAATCTATGGCCAAAGACAAGGATAAAACCGAAGCGAGGAAGGAGAACGAAGAGGATTGCATCACCATCAGTGACGATGAGGAAGGTGGGGAGTCGTGA

Protein sequence:

>DPOGS200462-PA
MASKQNMEDEENLVGKKELDDTTNVNDQKDGEVYESVDDDMELKWEDDDIDDVSITNEDALLEDVAMDNDDKLLPSDSVIPVASQESITYEINPNEFINKANLDETFEPAKAHGLNKPDMMLEILTHNLSDLSDDEDLTNLKMSPDVDLERCSPNKDSQIENKADFMETDLNDDFDRISSLVHENVDDDLDKGADISMCEDSKPAETSLSRKLSVNDDSIDEDILLADDDKDEQDEGGMEELLDDKIDLDAVDILEINSEEKLELESEKNKLLQNIPDTDGLKLNNEEDVSGIKAECGAECKEVDEIINVPSPKPEISEAHVPKVINADQSNDSPTSEKEVQVSKTGIKRKKLSLRLRLDKTQSTGSDVIMDEHTSLKSDGSEVALRDPGNDQINTNNAHLTSEDVANQPDIANKDLITAADLESEHTKRKKDSEDTSSQLPDNDVSLKPKKDLKSKEKKLTPDIEPQPSTSGSKNIKINIESASKSDSRLTSNISKLSSPAEDIPGTTDNLDLLAESSRVTHDDEAEDEYMDDEEGEDFEQFDESSNQMAAEQSEDSEQHHSDNAHETTHSNEKEFSFTITDVVTENVVKVDIENQDSIKSENLESVPIGNVCQNMDVSKVGGIETDFGNENKKLTSEEEKNLDYQDETKDCTDVKKSEALSYVELEESSEEGNHELDALDANKTDEVGNMNTTQDINEDEPAKTEDTGLDDSNTLVEDTTNQDTKALDKDDKTKSQGLEVFNLDSDEEDVGEKNKTDISHQETPENPKPQSQWVKCINKSCANTSSDYYKADGITVNFYDPERKKRGYVCQTCLNLVEERNQLLISGIKSLVPLLKLEPGRPEEDLVEISDSESEDEAEPEDDDDVIGVEGARVIEEKLTDVLNETWVKYNLDDRLQEAQDQLKQQLEQLQKDSLEINQLLDECQLSTDKLRSELYSSFERDIKELPSLLIFDVPNCSYTCVDPSGEGSRLLKRRKSSVSESPAKKSALSTGDQDTNTKDMTDEKTEEDNPDVSVVHLSVESAPPDLPPAGEVTYPPLRVGMTIYASKNALGSWMKAKIVEITPKSSLPNCFTLCRVKYEYKQSKPTKILPARCIAYIDPPDVRMTIGTRVIALFKDITMKESFYPGIVAEIPNPVNNYRYLIFFDDGYSQYAPHSKVRLVCECASHVWEEVQPKSREFVRKYLLAYPERPMVRLHPGQSLKTEWKDNWWSSVVVSVDASLVEVQFLQLDRREWIYRGSTRLAPLYLELQAAERHRPRALPRAQTTRTNMPYVEYTRSEEQTSKQAETSPQQQQSEYYTPKKQVKPYKMVPHTCSPACKRTDVLALKDLRTYNPLAKPLLSGWERQIVLFKGNKVVLYVSPCGRRIRSPRELHRYLRTVGSDLPVDLFDFTPSTHCLAEFVLNKCYVGKKDLSHGKENVPVPCVNYYDESLPEFCSYNTERTPTAGVPLNLDPEFLCGCDCEDDCEDKSKCACWQLTLEGARTIGLEGENVGYVYKRLPEPLPSGIYECNSRCKCRDTCLNRVAQHPLQLKLQVFKTLNRGWGIRALNDIPKGAFLCVYAGNLLTDATANLDGLNEGDEYLAELDYIEVVEQMKEGYEEDIPENIKKMDEAEIAKQQLMPDDEMESSSSEEGSSTKNGEEDDDFSPGYIGLGVAKEKSMAKDKDKTEARKENEEDCITISDDEEVREPSNFTAAAGMGANEFKSKYRSVRSLFGEDEACYIMDAKVAKEKSMAKDKDKTEARKENEEDCITISDDEEGGES-