Monarch geneset OGS2.0

DPOGS210324
TranscriptDPOGS210324-TA6348 bp
ProteinDPOGS210324-PA2115 aa
Genomic positionDPSCF300025 - 698876-752054
RNAseq coverage21x (Rank: top 79%)
Annotation
HeliconiusHMEL0089070.085.70% 
BombyxBGIBMGA011962-TA0.078.45% 
Drosophilawry-PB4e-1634.86% 
EBI UniRef50UniRef50_D6WF180.041.78%Putative uncharacterized protein n=3 Tax=Tribolium castaneum RepID=D6WF18_TRICA
NCBI RefSeqXP_001809017.10.042.01%PREDICTED: similar to notch homolog 5 [Tribolium castaneum]
NCBI nr blastpgi|1892350920.042.01%PREDICTED: similar to notch homolog 5 [Tribolium castaneum]
NCBI nr blastxgi|1892350920.042.20%PREDICTED: similar to notch homolog 5 [Tribolium castaneum]
Group
Gene OntologyGO:00054887.8e-35binding
GO:00055156.1e-05protein binding
KEGG pathwayame:4103513e-14 
 K06051 (DLL)maps-> Notch signaling pathway
InterPro domain[94-239] IPR0161877.8e-35C-type lectin fold
[93-240] IPR0161864.2e-30C-type lectin-like
[94-237] IPR0013041.9e-25C-type lectin
[568-678] IPR0008592.9e-25CUB
Orthology groupMCL18383 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210324-TA
ATGGGAACTGGTCCGATGGTTAAAAAGAATTCAGACAATTTTAATATTATTCATGAAGACGGTCATAGCTTTTGCAAAGGTATGGAGTTGAACCTAGTAGATGGTGATTCAAGATGGAAACATTGCTCACAGCCGGGGGATCCTTTGCGAAGTGTGCAAATAATTTCAGAAAGAAATTCAGTCAAGCTTAATATAAGTATTTTAGCAAAGAAAAATTCATCCGCAATGTGGTTAAAAGTATGGTGGATGGATAAACCTATCGAGGAAGTTATAGGACAATGTGATTTTGGTTGGGTGGTGTCCGGAGATTTTTGTGTTACCTCTGTGAGGGAAACAAAGAGTTCGTGGCGACAAGCCGAGCTCGAGTGTGTTCGACTTGGGGGTCACCTGGCAAGCATCCTTAACGAACGTCAGCAACAAATTATCGACCAACTACTTATTCACACACCAGGAGCCGGCGTCGATGACGTCTATTGGATAGGTGCCACCGACTCCGTCCACGAAGGAGAATTCCGTTGGTCGGATGGACTACCTTTTTCATATGCACACTGGTTTCCCGGTTGGCGTAAACACGCTGGCCAACCAAACGACGACGGAACCTCAGGGCAGGACTGTGTGGAGGTACGACGAGAACTGCCCCCCAGACCAGCTCATCCAACCTTCATGTGGAACGATAGAAGCTGCAGGGAGAGGAACTACTACGTTTGCGAGAGACCAGGCGTTGAAGATCCGTACAAGGCTTCTAAAGTCCAGTGCAACGAGTCGATAGTGCTGTCACACTTGCACCCGCAGGCTACAATCTCGAGCCCCGGCTTCCCTCGCCCCTATCCCGACGACGTGCATTGTGTTACACAGATCAAAGCTCCGCCAGCCCACACTATACGTTTACATTTTGAGGAACTGCTCACCGAACATGAACCTCATTGCAGTTACGACTTCCTGGATATAATAGAGTCCGGCCTGGAAAATATCACAGAATACGGTCCAGTGGAATGGATACCCCATGATCACCACTTTGGCTCCGAAAATGAGGTTTGGGAGGAATTAGAAACTCTTTTACCGGAATCGGATGCGGTGAGTGCGGGCTGGTCACGAGACACTCACAGCACACGAGCGAGGCGCCTCTGCGGCGATTGGAGCGGGAAGCTTAAGCTGCTGCGCTACCAGTCTCGTGGCCACACACTGCGGCTTCGATTTAAATCGGACCACTCTAGACACTTTGCGGGCTATCGAGCGAAAGTAACTCTGACTGACACACAGTCGTGTATGGACGTGAAACAGGTACTGTTCAACGGTTACTGCTACCTTTTCTCCGGCTACCCACAGGCTTCTTGGTCAACCGCGAAACAGGTTTGTGAAGGTCTCAACATGCATCTGTCCTCGATACATACAGCGGAAGAGGAGCGCTTTATAGTGACCGGTATCAGACAATCCAGCGATTACAGCGCTGGATCCGTATACTGGCTCGGTTCCCGTCTAGACGATAATGCAATAAGTTGGATCGATGGAAGCACTTTAGATTACCAAGCCTGGCCTCCTTATAATGACACCGAGGAAGTCGAAGACAGCTGTTTAGGTGTTCAGTGGAAAACCTCTCCAGTACCCTCACAGCCATCGGGACTGTACTGGACTCCATACAAGTGCTCGGCCACTGGTGGCTACGTGTGCAGGAGACGCCTCACATCAGAGCACGTCCTCAGAAATACTACCGTCGAGGGCACTTCGGGTACATTAAGGAGTCCTAACTATCCCGGTCTGTATGACAACGATCTGGATTATTGGGTTCATGTCAGAAGTGCCCCGGACACACGCCTAGTATTCGTTTTCACATCCATCAACTTGGAATATCAAAACGACTGCCTTTATGACTTTATAGAGTTACGAGATCGTAAAGTCAGTTCGAAGTCATCGAGATACTGCGGTTCGGTCGGAGAGACGAGATGGGTCGCCGCCACCAACGAAGCGATACTTCACTTTCATTCTGACTATAATACCCAGGGTGCAGGTTTCTCAGTGAACTGGTGGGCGGTCGAGCTGGCGGGATGCCCATCTCAGACGTTCACGTCCAAGGAAGGAATCATTCACAGTCCTAATTACCCCCACTTTTTACTACCCGATATGGATTGTACAATCGACATATTTGCTCCAGCGGGAAAAAGGGTGTACTTAAATATAAGTTTTTTTGATTTCGGTTATGGACAATTCGAGAACGGAATTCCGAACAACGTGTCGGATGTCATATCCGAAGACAATTATTTAGAAATTCAAGTCGACTCTCAAAGTCGACCTATAAGACCATTCCAAAATTCAAAAATTTTAACAAACGGACTGTTCGTGTCGCAATCTGAAATCATGAGAATACGACTGAAAACCGGAGAGAACGTTACCGGGATCGGATTTCTTGCTCATTTCAAGACTGTGTGGCATTTAAATGCTTCCATCACGATCAGTTTATCGAACGCCGGTCGTCTGGCTTCTATAAACTATCCAGAGATGGGTCCGTCCCGGAGTACGCTCCGCATGCGACTGGTGGCTCCCCACGGGCACACGCTCGCAGTAGCGTTCAGTTCCACTGCCCTTGTACCAGCTGGTGAATACCCATGTGGAAGGGAGGCAGGCTGGATCGAGGTGGTTGATAGTTACACGGACAATAATGGCACACAATGGACTCTTTGCGAGGCGAACCTTCGAAAAAGAGCCGTCGAGAGTGCTCCACTTGTCATCACCTCATATTTGCACTCATTGATAGTGACGCATCACTCTGGTGAAGAACCTATGGGACTAGACGTAGCTGTAGTAGTAAATATTGATACAGAATATCACAATAAGGTTCTCCTACTACCGGATGAAACTAATTTAGAGTCCTGTTACCCGAACCCGTGTCTGCACGGTGGACAATGCGCTTTTGAAGATTCCAGAAACATGTGTCAATGCTCAGGATATTATACAGGTGTTTTCTGTCTCCTGACTGCATGCGAGCGATCCCCCTGTGTGAACGGTAACTGTTCGCTGGCGGCGGACGGTGCGCTGTGTGCCTGTGCTCGTGGCTGGAGAGGCCGGCGCTGTGCCGAGCGAGTGAGACCTTGCGCAGCGCGGCCCTGCAACCACCGGGGGGCTTGCGTTGAAAGGGACGCCGGCTTCTTGTGTCAGTGTAACCCTTCATGGAAAGGAAAAAGATGTGAAATACCTAATCCTACGCCAAATATCGTAGGTTTGGGTACAAGAATGATGCAAGAACCATTTTGGCTCGGTTTATTCGCTGTTTTCGTCGTCCTCGGTTTTATTGGCCTAATTTGGTGTGGAAAAAGACATTTCCCTGAAAAGATAGAAAAACTCCTAGCTGAAGAAGCCGACAGATGCGCGAGACGGGTGGCGGGTCCAGGGCCGGGGCCGGAGGCGCGCACACTGCTCACGCGGCTGGGTATCCGCAAGCCATCGATGACGGGGGCTCACCCGCGCGCGCGCACCTTTAGTTTGGACGATCTGTTAAAGCCGCCACACGGACGATCTCCTTCGCCTCGTAAAAAACGCAACAATTCAACACCAACAAAGAAAAACGCAGCGGAGAAAAAACAGATATTACAGCAACTCATAAGCCCGGCGCCAGCTAATGATCACTCGAAGAAAATAAGTATGGGGGAATTAATACAGATGTCAGAGAAGAGAACACTGAGTACAGTAGAGAGTGCCCCTGCGTTTGTAGAATATGGATTAGACATTAAAGACACGTCATTCGCTAGTGAAGCCTCGTCACCCTCCACGTCACCGTCAATGCGTCAAGTATCTGACCCTAAACTGGAAAAGAAAGTGACGTTTGCAAGATTATTGAACAAAGTTTCAGCCGAAATGAGCTCGAGTTCAGATGTCGATATGGTTAATACTATTGCAATTCCGATGGCCGTCATAGCTGATAGAACTATAAAGACAAAAGCCTCGAGCACTCCACCCTCACCTGGTGTAGAAGTAAGGTCCCCCCATAGTACGTCAAGTAATCAAGGAAGCGACTCCCTGTCCAGTCTTGATTTGACCTTGGCTAACGGTGCTATAAAGAAATTTTCTAGAGCACCAAAAATATCAAGCGCTGATTCCATACTCGCAATGTTCCGAAACTTTTCTTCGTCAGCTGCTATAGTTTCTCGCGCTTCCTCTGGAGCTATATCCGCATCCAGCACACCCACAGCATCATCACCTCAAGACGACACCGTCGATGGCGATGATCTGTCAATAGCCTCTTCACATATACCTTCTTTAGCACCCGATTCTCCCATTTCAAGACCACACACAACTATCGAAATACAGGTTGTTGACCCACTAAGCTCTCATAAATCTTCTACATCTGGTAATCTCCTACACCCTCCATCCATCCTACTTGAAGTACCCAGTAGCATCAATAAGTGTCTGTCTCCTATTCGAGAACTCCCAACACCACTGCCAACTCCGCTCCCTACACCACTGCCAACCCCACTACCATCACCACGAATGCCACGGGTCAAACTGGAACAGGATATCAAAAAAGATTCGGTAATTTCTTGCATATCACCAAGCGAAGACGAATTAGAGGAGATAAGATATGAGAAAGCTGAAAGACCAACAATGGGACTGAGATTAAAGCTTCGACAGCCGGCAGTGTGTTCTGAAACCCCAACGCCGTCCACTCACGTCACGGACTCACCTAGCCCTAGTTTTACACACAGTCAGGACTCTGAGAGAGAAATGTCACCTCTCTCCCCCGCCCCGCCCTCGCTGAGAGTTCCCGTCCTCACTATTGAACGACCCTCACCTGGATCCCCTCCACCGAGAAGAACCCCACCACATTTAGACTATCAACCACCGCCATTAATAACAGTCACATATAACCCCAGTGAAGAATCAGATGAACCTATGTCACCAAGACCACCTCCGCCAACGGCAAATATGTGCTATCTGAGCCCCTTTTCGATGTCGGCAAGAGGAGAGAGAGCACCATCAGAATCTAATTTGTCATCATCGGGATACAGTTCAATGGCAAGTCCTGGCCCGTCGAGATGTGGGTCTAGTAATCCCTTGTGTCCATCTGAAATGGAAGATCCTGGATCAGGAGGAGGTCCATCTTGTTTTCAATCTAGGCGGAGACCACTCATGAAGACAAATTCAAGTCCAGCGGGCTCTAATGATGGAGGCAATGAGAGAAGAAGAGGTAGGTCTGATTCTGAAACACTTTCAGACGATCCATTGTTAGAATCCAACGACGAAGGAATCGGTACGGATGAAAGAGTAGACGATGTTCCTTCAAGTGCAAAAGAAATGGAAACTTTGACAGTGTTAAAAGAATGCTTAGATATACCACAAACAACTTTATGCTCCCCGAGCGGTGTCACTAAATGTACCATCGTAAAGTGCATAAGTGTTGAACGAGGTTTAGATGAAAAGGCGAGTCTTAAACCGCCAATTTTGTTCTCAGATTGTAGTAGACCATTGAGCCCTGTCAGTTCAAGAAGTGAGAGTCCTTTAAGTGACAAAACTGGTTTAGGTAGATTCTCTCCACAATTTTACGGTAGACAACTACCTTTTACTGATTCCGATGGACTTTATGATTTTCCAAGTTCCGAATGCGTTAAAGGCGGTAGTTGTAAGAGTGGGAGCGCATCTCACAGAAAAGCCGGTAGAAGAAGAGACAGAAAAACGACCAGGACAACGTCACATGAGCCCACAGGAACTACTAAATCTACTCTACCACACATGCCGCACTCCATGCATAATTTATTAGAGGTGCCGTATGGCAATCGTGGTCGCAAAGGAGGAAGAAGGAGATCGAGGTCTCAGGCGCCAGCTCTAGCTACCTCATCATCATCTGAAGAGTCTGTGTCCAACGCATCCGTGGCCTCTGTGGCATCAGCTCGAGAACTCAGACTACCCGACTTGGAAATGCAGTATGTCTGTTCACAGCCTGAGCCTGTCAGAACTAAGAAGCCGTTGAAGCGTCAAAAATGCCGAAGCTCTGAGGACACGTCGTCTAAAATATCATCAAGCTTGGACCTAACCGAGGACTCGAAGAAACCGAATAAAATTAGCAAACTCCGGTCCATCGGAAATCAAATAAGATTTCTCCGTCGGTTAGAAAAAAGTTTAAAAATGAAAGAAAGCTACCCGGCGATCTCAGACGACGAAGGAGACGAGTCGTCCAGCGTGACGTCACCCTTGTTACAAGGTAGGAAGGATCTGAACCGTATGACGGGCCACGCGATCTCCGCTCCTCTGCTGGGGGCTGCAAGACCAAAGATCTCGCGACAGAGACGGTACGAGAGATCCCTCCTTGGCGAAGACACGAGGACTTTGAGTACTGCGCCTGGCTACGATAACTCCGATTGA

Protein sequence:

>DPOGS210324-PA
MGTGPMVKKNSDNFNIIHEDGHSFCKGMELNLVDGDSRWKHCSQPGDPLRSVQIISERNSVKLNISILAKKNSSAMWLKVWWMDKPIEEVIGQCDFGWVVSGDFCVTSVRETKSSWRQAELECVRLGGHLASILNERQQQIIDQLLIHTPGAGVDDVYWIGATDSVHEGEFRWSDGLPFSYAHWFPGWRKHAGQPNDDGTSGQDCVEVRRELPPRPAHPTFMWNDRSCRERNYYVCERPGVEDPYKASKVQCNESIVLSHLHPQATISSPGFPRPYPDDVHCVTQIKAPPAHTIRLHFEELLTEHEPHCSYDFLDIIESGLENITEYGPVEWIPHDHHFGSENEVWEELETLLPESDAVSAGWSRDTHSTRARRLCGDWSGKLKLLRYQSRGHTLRLRFKSDHSRHFAGYRAKVTLTDTQSCMDVKQVLFNGYCYLFSGYPQASWSTAKQVCEGLNMHLSSIHTAEEERFIVTGIRQSSDYSAGSVYWLGSRLDDNAISWIDGSTLDYQAWPPYNDTEEVEDSCLGVQWKTSPVPSQPSGLYWTPYKCSATGGYVCRRRLTSEHVLRNTTVEGTSGTLRSPNYPGLYDNDLDYWVHVRSAPDTRLVFVFTSINLEYQNDCLYDFIELRDRKVSSKSSRYCGSVGETRWVAATNEAILHFHSDYNTQGAGFSVNWWAVELAGCPSQTFTSKEGIIHSPNYPHFLLPDMDCTIDIFAPAGKRVYLNISFFDFGYGQFENGIPNNVSDVISEDNYLEIQVDSQSRPIRPFQNSKILTNGLFVSQSEIMRIRLKTGENVTGIGFLAHFKTVWHLNASITISLSNAGRLASINYPEMGPSRSTLRMRLVAPHGHTLAVAFSSTALVPAGEYPCGREAGWIEVVDSYTDNNGTQWTLCEANLRKRAVESAPLVITSYLHSLIVTHHSGEEPMGLDVAVVVNIDTEYHNKVLLLPDETNLESCYPNPCLHGGQCAFEDSRNMCQCSGYYTGVFCLLTACERSPCVNGNCSLAADGALCACARGWRGRRCAERVRPCAARPCNHRGACVERDAGFLCQCNPSWKGKRCEIPNPTPNIVGLGTRMMQEPFWLGLFAVFVVLGFIGLIWCGKRHFPEKIEKLLAEEADRCARRVAGPGPGPEARTLLTRLGIRKPSMTGAHPRARTFSLDDLLKPPHGRSPSPRKKRNNSTPTKKNAAEKKQILQQLISPAPANDHSKKISMGELIQMSEKRTLSTVESAPAFVEYGLDIKDTSFASEASSPSTSPSMRQVSDPKLEKKVTFARLLNKVSAEMSSSSDVDMVNTIAIPMAVIADRTIKTKASSTPPSPGVEVRSPHSTSSNQGSDSLSSLDLTLANGAIKKFSRAPKISSADSILAMFRNFSSSAAIVSRASSGAISASSTPTASSPQDDTVDGDDLSIASSHIPSLAPDSPISRPHTTIEIQVVDPLSSHKSSTSGNLLHPPSILLEVPSSINKCLSPIRELPTPLPTPLPTPLPTPLPSPRMPRVKLEQDIKKDSVISCISPSEDELEEIRYEKAERPTMGLRLKLRQPAVCSETPTPSTHVTDSPSPSFTHSQDSEREMSPLSPAPPSLRVPVLTIERPSPGSPPPRRTPPHLDYQPPPLITVTYNPSEESDEPMSPRPPPPTANMCYLSPFSMSARGERAPSESNLSSSGYSSMASPGPSRCGSSNPLCPSEMEDPGSGGGPSCFQSRRRPLMKTNSSPAGSNDGGNERRRGRSDSETLSDDPLLESNDEGIGTDERVDDVPSSAKEMETLTVLKECLDIPQTTLCSPSGVTKCTIVKCISVERGLDEKASLKPPILFSDCSRPLSPVSSRSESPLSDKTGLGRFSPQFYGRQLPFTDSDGLYDFPSSECVKGGSCKSGSASHRKAGRRRDRKTTRTTSHEPTGTTKSTLPHMPHSMHNLLEVPYGNRGRKGGRRRSRSQAPALATSSSSEESVSNASVASVASARELRLPDLEMQYVCSQPEPVRTKKPLKRQKCRSSEDTSSKISSSLDLTEDSKKPNKISKLRSIGNQIRFLRRLEKSLKMKESYPAISDDEGDESSSVTSPLLQGRKDLNRMTGHAISAPLLGAARPKISRQRRYERSLLGEDTRTLSTAPGYDNSD-