Monarch geneset OGS2.0

DPOGS200577
TranscriptDPOGS200577-TA5739 bp
ProteinDPOGS200577-PA1912 aa
Genomic positionDPSCF300303 + 133639-144525
RNAseq coverage91x (Rank: top 63%)
Annotation
HeliconiusHMEL0169490.058.29% 
BombyxBGIBMGA002246-TA0.063.81% 
DrosophilaMes-4-PA4e-15046.55% 
EBI UniRef50UniRef50_D6WZP00.047.13%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WZP0_TRICA
NCBI RefSeqXP_973711.10.047.13%PREDICTED: similar to NSD1 [Tribolium castaneum]
NCBI nr blastpgi|2700140060.047.13%hypothetical protein TcasGA2_TC012700 [Tribolium castaneum]
NCBI nr blastxgi|2700140060.042.54%hypothetical protein TcasGA2_TC012700 [Tribolium castaneum]
Group
Gene OntologyGO:00055154.8e-40protein binding
GO:00056341.3e-13nucleus
GO:00180241.3e-13histone-lysine N-methyltransferase activity
GO:00082701.1e-07zinc ion binding
KEGG pathwaytca:6625270.0 
 K11424 (NSD1_2)maps-> Lysine degradation
InterPro domain[1663-1786] IPR0012144.8e-40SET domain
[1474-1536] IPR0003135.5e-15PWWP
[1612-1662] IPR0065601.3e-13AWS
[1412-1487] IPR0110114.5e-11Zinc finger, FYVE/PHD-type
[1424-1473] IPR0130835.5e-08Zinc finger, RING/FYVE/PHD-type
[1429-1469] IPR0019651.1e-07Zinc finger, PHD-type
Orthology groupMCL10357 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200577-TA
ATGGATGAATCTGAGGACAATTTGAGTGCTGAGGAGATAAAACTTGAAGATGGTTCATCTGTTAAAGTTAAATCAAAAAAGCGGTCTCTTGTTGAGAGGGAGCTTGAAACAAATTTAACAGCCAGGGTTACCTCTCCGATCATGAATAGAGATGCCTCAAGCACCAGTCGCTATGGGAGGGCTCGTCGGTTGAAAACCGAGGCAGATTTCTGTGATACTGATAAAGCTGTGACGAAATGCTTAAAATCACCAAAACGTGAAGTAACAAAGTCACCGAATAAGATTCAATCGCCAGCCTATAAAATGCATGCCTCAAACTCTCCTATAAGAGTTGAGACACCAAAAAAAGAATCTTTGAATAATCAAATCGAAAGCATTTACAGCGAAAACATTTCATTGAGCCGTTTCAGACCTGAAGAAAAAAAATCAGCTGCAAAAAAGTTTCCCAAAGTATATATCAGGAAAGATCTCATACAGACCAAGGAGAAAGAAATCGATGACACTGTTGTATTGATAAAAAATATATTCTCACCGACCAACAGTATAACAAAAACAAAGCCAAATTTCAATACAGAATCGTCAGCTGAGAAAAATTATTCAAATAAGATGAACAATTATATGAATACATTGTCCGTTGTTAAGACATTGGATTTTGATGGTAATAGAAAGAAAAAGCGGGAAGACAGGACATCGTTATCTAAGAGTGAGCTTTTTGATTTGGAGGCGCAATCCGAATATCAAGTAGGGGATCTGGCCTGGGCTAGGATGGGGAGCTACCCATTTTGGCCGTGCATCATAACCAGAGACCCGCATAGTGGAATGTTTGTTAAAAAAAAGTTATTCGGTCGTATCGAGCGCGACGTTATGCATGTTACCTTCTTCGGTGATAATGGCCGCCGCGGCTGGATCGTGGACAGTATGCTCAGAAAATTCCTAGGCCAGCTTGAATTTGAAGCGGCAAGGATGAACTTTAGCACTGAGGCAAAGAAAAAAGACCCAAGATTGTTTGCTGCATTTTTTATATCAGAAAAGAAGACACCACAATGGCAGATATCGGTGGAAGAGGCAGAGGCATTGCTTAGGGAACCGAAACGATTAAGAATTGACATACTAAATGATATGATTGAAAAGTCCAGGGCCTTGAAAACTACACCCAAACCAGAAAAAGGCAGAAAAATTACTAGAACAAACAGCGATGTTTCGTTGAGTGAAAGTTTGTATGACACATTATTTTCCGAAGATGACAGCAAGGTTAAAGAAACAGAAAGATCTAGAAGTAAAAGTAGAAATAAGTCATTGGATGTCTCCGAGGTCGTAACAGCCTGTTTGGATAACATGGCCGCCAAAACCGGTATAACAAGAATACAGAGGCAGTCACATATGGACAGATGGTTACAGAAGGCTAAATCCAAAACTCCAGAGAAATCCCATCATAAAACCCAAATCAGAGAAAGCAGTAGTTCAAAACCAAAGAAAAAAAATAAAGACAAGAATAATAAAATCACGTCAGATAACAAACACATCGACCATGATTACAGCCGAGATTATAGTCCGAAAGATAGTCCAATCGTCATCGAGGATGAAATAGAATGTAACGCTGGAGTATTCGATATACAGCCTGGTTCAGGAATCGGCATTATATCAAAGGTTGAGACGCTCTGTGGTAAAGACAAACAAGGTCACGAAAAAGGTCGCATTATAAATGAGGTAGCACAAGAAAATGCTGGTGCACAGAATACAGATGTTGAACAAGTAAATTGCGTCGAATCAGTTCTATGTGGTACTGAACATAACAATGAGATATCCCCAAAAGTGGTATCCCAGATCAGTACCGACGTCGAGAGTGACGTTGAACAGATAAATTCCGTTGAATCACCAGAAATATCATCAATACAATCTGAATATAACGATGATGATAAAAATGAACCCGTACTGTTGTGTGATCAAAACGACATCGAAGTGAGCACTCTGAACATTAATGGCATCAAAAATTCACATTCAATATACACTAAGATACCGGATGTTACGGATAATAATATTTTGGATAATATATTCGCCACGGATGACAGCTCTAGTGATGTGGACACGAACAATGAAAGCGTCGATGATAAATCGACCAACAATGAAATCAAGGAAGCATCGATAGATTTAAATAACGTGAATGAGGAGGGTAAAGAAGTAGAAGATAAAGATACAGAAGAGAGAGAAGAGAGCAGTCCGTCTAGAGATGTAACAGGTCAAAATGCATCACCGATTAATGATCGTGGCGAGAATTTAAATAAGATTACTAACAGTGTTGAAAGCTCATTAGACTTAGATCAGAATACAAACGCTATAATTAACAATGACTCACTAAACAAAGAAAATCAAGACGCGAAAGATCAAGAAGCAATGAATGTTAACAATATGGAATTATGTGAAAAGGATAATATAAATCAAGAATTGGATAGTGTTGTAGAAAATATTTCACCAGACATAGAAACTATAAGACGTCGAAAGCGTTGTTTGAATGTGCCTTTAAACAAATCTATCGATATTGTTACGAAAACAAGTGAAGATAAAACGAATATCGACAAAGAAGAGAATTGTGACAATAATTTAAATAAAACTGCCGATGAATTACCAAAGACTGAATTAACTATTAATACTAATACAAACAGCTCGGATGAGGAAGCTGAGGGTAAAAATCCCATCATAACAGATGAGCTATCAGATCAAAATGATGGGAAAGCAGATGAGATTAAAGAAATGGAGATGGAAAAAGATAATGATAATGTATCAGTTGTATCAGAGGGCAGCGATATATCAAGAAAGAAACGCGCCAGAGACAAGCCGTCTGATAAGAAATCTTTGTTATCTGATGTAGAATTCCTGAAATATCTGGAATTGAGACAGGATGCGGTCATAGACGAGCATCCCGAGCTCTCGCAGGAAGACATCACTAGCTATTTATACAAAACCTGGATATACGAGGAAAATTTGAAACCAGATATAAAGAAATGCGATGACATAGACCAAGCTAATTTAGTGAAGGGTTTGAACTTAGACCCAGCGCCGGTCAAAAAAGTCAGGAAGAGGGTTAAAGTTGACAAAGAGATTGCGTGCGAGGACACCGCTACCAAAGAGAAATCTAAAAGAAAAATAATACGCCCATACTATAAAGAGGAATTTTCAGACGGGGATGATAGTGTGGAATATTTTGATATATTTAAATCTAAAAAGGACCAAAAAGGACCAATTGTGGACAGTAAAGAAATATATCAGAGCGACGGAACTGTCCTCGAACGTATTATAAACGTAGACGAGTACGTTCAGGACGAATACGATGACGTCGAAGAATACTTCAGACAGCTAACAGCGCCAAAACCTAACGTCTTTAAGGGTTACGCGAGGGAAAAGGTGTGCGAAATATGTGAGAAAGTCGGCGGCTTAGTCAAGTGCAAGGGTTGCCATTCAATGTTTCATGTGGAATGTGACAAGAAGGAAATCGAGGTTATAGAATGCCAGACGCCAACAAGAGGCAGGAGGAGGAAGAAGAAAACTAGAGGAAGGAAGACCAGGGACGATCACAACCAAGACTCCGGCAGCGACGAGAAGTCGCAAGACACCAACGGCTCGGACGAATTACATATGTCGCTGGAAGAAGAATCTCATATAATAGCAAATGCGGACGATTTTGAGGCTCAAATGTCCGTAAGAATGCAAGAAATACTCAAGGATCAGGACATTCAGTACGATTTCTATTCACGCGAGGAGTTGGATTGGAACGACACTCACGCGGGCGAATGTAAGGTCGTGGACATAAAGCCGAGAATGGATTCCATAGAAATAACGGATTATTCGGAATTCAAATGCAAGAACTGCCAGAAATACGATCCGCCGGTATGTTTCGTGTGTAAATATCCTATATCGCCCAAAGAGAAACAGGGTCACAGGCAGAAATGTCAAGTGGCTCATTGCAATAAGTATTACCACTTGGAATGCTTGGACCATTGGCCCCAAACACAATTCAACGGGGGAGAAATTTCTAGAACGAATAAGTTCAGCGAAGCCCTAACTTGCCCGAGGCACGTGTGCCACACTTGTGTCTGTGACGATCCCAGGGGTTGTAAGACGAGATTCAGCGGTGATAAATTAGCGAGATGCGTTCGCTGTCCGGCCACTTACCACACATTCACGAAATGTCTACCGGCTGGGTCACAGATACTGACCGCCTCCCATATAATATGTCCACGACATTATGAACACAGGCCTGGCAAAGTCCCCTGCCACGTGAACACCGGCTGGTGTTTCATATGCGCCCTGGGCGGATCTCTGATATGTTGTGAATACTGCCCGACGTCCTTTCACGCTGAGTGCCTTAATATTAAACCTCCCGAGGGTGGTTATATGTGCGAGGACTGTGAGACTGGTAGACTACCGCTGTACGGAGAAATGGTCTGGGTGAAGCTAGGACACTACAGGTGGTGGCCAGGTATAATTCTTCATCCGTCTGAGATTCCAGACAACATCCTAACCGTGAAACATACCCTCGGTGAATTTGTGGTCAGATTTTTTGGACAATACGACTACTACTGGGTCAATAGAGGCAGAGTGTTCCCGTTCCAAGAAGGTGATTCGGGTAAAGTTTCTAGTCAGAAATCCAAGATAGATGCAGCATTCACTATGGCGATGGAGCACGCACAAAGAGCTTGTTCGATTTTGAAAATGGCTGCGCCGAATGAAGAAGAGTCTTCTGACATAGCATCTTCATTGTTACCACCTCATTATGTTAAATTGAAGGTGAATAAACCTTGCGGGTCACTCTGCGGCAAGAAAATAGATTTAGAGGAAAGTTCATTGACCCAGTGCGAATGTGACCCTAATGATGTCGATCCTTGCGGTCCCTATACTCAATGTCTCAATAGAATGCTTCTAACTGAGTGCGGTCCGACGTGTCGCGCCGGAGATCGCTGTAACAACAGAGCGTTCGAGAAACGTCTTTACCCCAGGCTGGGACCCTACCGCACCCCGCATAGAGGCTGGGGGCTACGGACCATGCAGGATTTAAGAGCTGGCCAGTTCGTTATAGAGTATGTGGGGGAGCTGATAGACGAGGAGGAGTTCAGACGTCGCATGAACAGGAAACACGAGGTCCGGGATGAGAACTTCTATTTTTTAACGTTGGACAAAGAGCGCATGATAGACGCCGGGCCGAAAGGGAATCTGGCGAGGTTTATGAATCATTCCTGTGAGCCTAATTGCGAAACACAAAAGTGGACGGTGTTGGGCGACGTGCGTGTGGGATTGTTCGCGTTACGTGACATACCGGCAAACAGCGAGCTCACATTCAACTATAACCTGGAGACGTCGGGTATTGAGAAGAAAAGATGTATGTGTGGAGCCAAGAGGTGTTCAGGATATATAGGGGCTAAGCCTAAACAGGAGGACCAACCAAAGAAAATCAAGCCGCAGGTGAAAAGGATTTACAGGAAGCGCAAAGCGGAAGAATCGCCGTCTACGAGCCAGTACAAGAAACGAGGCAGACCCATAAAACCGCGAGAGCTGACCGAAATAGAAAAAGATCTTTTAATCATCAAAAATGCGACCAACGGCCTGTCTAGCGATTCAGAGTGCTCCAGGATAAGCATGGACAGCTGCAAAGATATAAAGGCGCTCAAAAGGAAAAGAATCAACCTGTCCACCGAGGAGTTGTCCCCGAAGAGGTCTAAGACGGATGAAATGAATTTGGTTTATTGA

Protein sequence:

>DPOGS200577-PA
MDESEDNLSAEEIKLEDGSSVKVKSKKRSLVERELETNLTARVTSPIMNRDASSTSRYGRARRLKTEADFCDTDKAVTKCLKSPKREVTKSPNKIQSPAYKMHASNSPIRVETPKKESLNNQIESIYSENISLSRFRPEEKKSAAKKFPKVYIRKDLIQTKEKEIDDTVVLIKNIFSPTNSITKTKPNFNTESSAEKNYSNKMNNYMNTLSVVKTLDFDGNRKKKREDRTSLSKSELFDLEAQSEYQVGDLAWARMGSYPFWPCIITRDPHSGMFVKKKLFGRIERDVMHVTFFGDNGRRGWIVDSMLRKFLGQLEFEAARMNFSTEAKKKDPRLFAAFFISEKKTPQWQISVEEAEALLREPKRLRIDILNDMIEKSRALKTTPKPEKGRKITRTNSDVSLSESLYDTLFSEDDSKVKETERSRSKSRNKSLDVSEVVTACLDNMAAKTGITRIQRQSHMDRWLQKAKSKTPEKSHHKTQIRESSSSKPKKKNKDKNNKITSDNKHIDHDYSRDYSPKDSPIVIEDEIECNAGVFDIQPGSGIGIISKVETLCGKDKQGHEKGRIINEVAQENAGAQNTDVEQVNCVESVLCGTEHNNEISPKVVSQISTDVESDVEQINSVESPEISSIQSEYNDDDKNEPVLLCDQNDIEVSTLNINGIKNSHSIYTKIPDVTDNNILDNIFATDDSSSDVDTNNESVDDKSTNNEIKEASIDLNNVNEEGKEVEDKDTEEREESSPSRDVTGQNASPINDRGENLNKITNSVESSLDLDQNTNAIINNDSLNKENQDAKDQEAMNVNNMELCEKDNINQELDSVVENISPDIETIRRRKRCLNVPLNKSIDIVTKTSEDKTNIDKEENCDNNLNKTADELPKTELTINTNTNSSDEEAEGKNPIITDELSDQNDGKADEIKEMEMEKDNDNVSVVSEGSDISRKKRARDKPSDKKSLLSDVEFLKYLELRQDAVIDEHPELSQEDITSYLYKTWIYEENLKPDIKKCDDIDQANLVKGLNLDPAPVKKVRKRVKVDKEIACEDTATKEKSKRKIIRPYYKEEFSDGDDSVEYFDIFKSKKDQKGPIVDSKEIYQSDGTVLERIINVDEYVQDEYDDVEEYFRQLTAPKPNVFKGYAREKVCEICEKVGGLVKCKGCHSMFHVECDKKEIEVIECQTPTRGRRRKKKTRGRKTRDDHNQDSGSDEKSQDTNGSDELHMSLEEESHIIANADDFEAQMSVRMQEILKDQDIQYDFYSREELDWNDTHAGECKVVDIKPRMDSIEITDYSEFKCKNCQKYDPPVCFVCKYPISPKEKQGHRQKCQVAHCNKYYHLECLDHWPQTQFNGGEISRTNKFSEALTCPRHVCHTCVCDDPRGCKTRFSGDKLARCVRCPATYHTFTKCLPAGSQILTASHIICPRHYEHRPGKVPCHVNTGWCFICALGGSLICCEYCPTSFHAECLNIKPPEGGYMCEDCETGRLPLYGEMVWVKLGHYRWWPGIILHPSEIPDNILTVKHTLGEFVVRFFGQYDYYWVNRGRVFPFQEGDSGKVSSQKSKIDAAFTMAMEHAQRACSILKMAAPNEEESSDIASSLLPPHYVKLKVNKPCGSLCGKKIDLEESSLTQCECDPNDVDPCGPYTQCLNRMLLTECGPTCRAGDRCNNRAFEKRLYPRLGPYRTPHRGWGLRTMQDLRAGQFVIEYVGELIDEEEFRRRMNRKHEVRDENFYFLTLDKERMIDAGPKGNLARFMNHSCEPNCETQKWTVLGDVRVGLFALRDIPANSELTFNYNLETSGIEKKRCMCGAKRCSGYIGAKPKQEDQPKKIKPQVKRIYRKRKAEESPSTSQYKKRGRPIKPRELTEIEKDLLIIKNATNGLSSDSECSRISMDSCKDIKALKRKRINLSTEELSPKRSKTDEMNLVY-