Monarch geneset OGS2.0

DPOGS214264
TranscriptDPOGS214264-TA4785 bp
ProteinDPOGS214264-PA1594 aa
Genomic positionDPSCF300014 + 1586228-1598746
RNAseq coverage30x (Rank: top 76%)
Annotation
HeliconiusHMEL0063996e-5646.24% 
BombyxBGIBMGA005982-TA2e-14247.78% 
Drosophilaeys-PC4e-16634.20% 
EBI UniRef50UniRef50_D6WKS50.040.04%Putative uncharacterized protein n=3 Tax=Tribolium castaneum RepID=D6WKS5_TRICA
NCBI RefSeqXP_001658690.10.039.50%crumbs [Aedes aegypti]
NCBI nr blastpgi|2700065870.040.04%hypothetical protein TcasGA2_TC010461 [Tribolium castaneum]
NCBI nr blastxgi|2700065870.040.95%hypothetical protein TcasGA2_TC010461 [Tribolium castaneum]
Group
Gene OntologyGO:00055091.8e-10calcium ion binding
GO:00055153.3e-07protein binding
KEGG pathwaybta:5137303e-57 
 K02599 (NOTCH)maps-> Dorso-ventral axis formation
    Notch signaling pathway
InterPro domain[567-782] IPR0133202.8e-26Concanavalin A-like lectin/glucanase, subgroup
[825-1047] IPR0089851.1e-24Concanavalin A-like lectin/glucanase
[896-1017] IPR0126801.2e-11Laminin G, subdomain 2
[63-99] IPR0018811.8e-10EGF-like calcium-binding
[879-1020] IPR0017911e-09Laminin G domain
[66-99] IPR0062103.3e-07Epidermal growth factor-like
[67-97] IPR0062091.4e-06EGF
Orthology groupMCL14729 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214264-TA
ATGTTTAAACTACTTAACAGAGGACATTTAAAGAGTACGTGGTGGTTACTAATTGTAATACCTATCGCATCCGCCGGCTTTGCCTGCCTTAATAATCCTTGCGTACATGGGATATGTATTGATGATATTAATAGCACATATTTATGTTATTGTATTGATGGATATACGGGTGTTCAATGCCAAACAAATTGGGACGAATGCTGGTCAAACCCTTGCCAAAACGGTGGCACTTGCATAGATGGTGTGGCATCTTACAACTGTTCTTGCCCGGATGGCTTCATTGGCGATAATTGTGAGACAAATTACAACGAATGCGATTCAAACCCCTGTTACAACAATGGCACATGCATTGACATGACTAACGAGTACGTATGTCACTGCATCCCCGGCTTCTCTGGAGATCACTGTGAGTTAGATGTAGCAGTGTGCAACTCGACGGGGGAGGTCAGGTGTCACAATGGAGGCGAGTGCATCGAGGGTCCCGGGTTCAAATTTTATTGCAAATGTGCCGCAGGATGGACTGGACACAAATGTGAAGACCAGATCGATGAATGCGAGTCGAATCCATGCAGGAATGGTGGCATATGCATCGACGCTCATGCTGATTACATGTGCGCGTGCACATACGGTTTCACTGGTAAAAGCTGCGAGGTAGCGATAGAGTTCTGCTCTCAGGATTCCTGCAGCGAGAAGGCGCTGTGTGTTCTGGAAGACGTGTTGCGAGTCTGCTACTGTGTGCCCGACTATCACGGGGAACGATGTGAACTACAGTACGACGAGTGCGCACTAGGACCAAGATGCCTGAACGGCGGGACGTGTATAGATGGCGTTGACAATTTCACATGCTCATGTCCACCAAGACTTACAGGACTTCTCTGCGAGTGTCTTATTTTGGATGACGGAAATTATGATTGTGAATATATTCGTCCAACTCTTCTACCAGACCATAGTACAGCCACTTCTTCCTTTACTGAAACCATAATAATAGACACAAGTACCATGGAATCGAAGTATAATTCAAGCACTACCACCACCAGCCTCAGTACTATCGATAGCGGCACCAGTATTGATGTCATTACTACGGACATGGCAATTTATACGAAACTAGATAATGTAACGGACATACCAATAACCGCTTCCAGTACAGACACAATAGTAACAGAAAATCTAACAACAAGCACAGAAATATCTGATATGACAACTGATTCATTGACTTCAACATCAATTTCAACATCTCGATCAACAACGAAAGAAGATTCGGTTACTGAAATAGTTACGATTCTGATAGAAACGAAAGGTACTATAGGAGCTGATGATTCTAAGACAGAAATAACAACAGAATGTAGTGGATCGTGTCCAAAAGGAAATTTTTCCACTAGTGATTTACCACCAACCACTATAACTTCAATAGAAACAACTGAAGGAATCACTACTTCCACAGAAATTACTAAACAAACAGCAACAGACACAACAGTTCAGACAACCGTTGATCTTAAAGAAACAACAAAGCAAATGACGTCAGATACCACAGAATATACTCATCAAGCACAAGACATGACAACAGAAAGAATGTTCACCGACAGTCCTGTTGAAACAACAGAACTTGCAACCGAATTAACCCATCCAATGACCGAAATCGAAACAAGTACAGGTTATAATCAAATTTCAACAGCACACTCAGATTGTACCGACGTCATCTGTAACAATCACGGCAGTTGTATAAACACTCTTCATGGCGTTAGATGTCACTGTTTGTTCAATTACGAAGGAAGATTTTGTGAGAGTAAAATTATTGTTAACTCGGCCGCCTTTGATGGCACTTCTTATATAGCGCATCATATAAAAAATTCTACCAGCATATCTATAGCATTCAAAGCCAAAACTCTAATCCTTGACGGGCAAGTTATGTACGTGGATATAGCTAAGGGCGCTTACATGAAATTGTATATGAATTCTGGCTTGTTGAGATTCGAATTCTCCTGTGGCTATCAGACAATGCTGCTAAGCGAACTTAAAACTCACCTTAACAAAGGATATATTATGAAAATTGAAACAAGATTAGATATATTCTTACCGGAAAATCATTGCAACGGAACCCTGAGACTTAACGACACTGTGGCTATGAGCGGCGGCCAGTTTGCAAATATTAGCTCTCCCGAGTATAATTCGATTCTCTATTTCGGGAACATACCTAATGCTAATAGAAATAACTCTAATGAGAAGTCTTTTATTGGATGTATCAAGGACTTAATTATAAACGACGAGAGACGTGAAATATTTAGCGACGCTTACGAAGCGTCTGAGGTGAGGGAATGCTCTTCTTTGTCTTGTTTGTCGTCGCCGTGTGTGAACGGTGGTACTTGCAATGATGACGACGATACATACTCCTGTGCTTGTGCCAATGGTTGGACCGGCGCCACTTGCAACGACTCCGTCTGCGACCACAACCCTTGTCAGTCCGGTGGAAGTTGTGTCCATCACCCCGGGAGCGGATTCCTGTGCCTCTGTCCATATGGCAGGCACGGCATATTCTGTGAATATAACGTGGAAATAACACGTCCGTCTTTATCGCCTATATCCCCTGGAAGGTCTTCCTACGTCTTGTATCCGATGTCACAGTCCGCAGCGAATTCTGATCGGTTTGAAATGCGTTTGCGTTTTCAAACGTCGGACATGGATCAGATAGCGTTGCTCGCGTTCGTTGGACAAAGAGGAAGACACGATGCCAGGAGTCAACATTTAGCTTTGACCTTTGTGAAGGGTTACGTTATGCTGACGTGGAACATGGGCGCTGGACCCCGACGTATTTTCACGTCCCGTCCTCTGGGTCCACGGCGCGGGGGACACACGGTGCGGGTCTGGAGACGAGGAAGAACAGCCGGCCTCGTGGTCGACGGGCGATACAATGTATCAGGGAACGCACCCGCCCACACCAACAACATGACTTTACTACCATACATCTATATTGGCGGTCACCCATCCGATGACTTCCGCGACCTGCCCCATGACCTGCCCCTGCACAGCGGGTGGTCGGGGTGCGTGTTGGAGGTCACGGGTCAGTCAGGGGGAGGACGGGGGGTCGGCGGCCGGGGCGTGGGCCAGTGCGGGGTCACTCAGTGCACCGCCAAGTCCTGTAACGCACCCCGCGGCGTCTGTATACACTCCCCCGCCACTTACGGATGCATCTGTAACGAAGGCTGGTTCGGTGCGACCTGCGCCAGCCCTCGCAGTCCGTGCGATCGATCGCACTCTCGCTGCCAAGGTGCCTGTGTCATTACACTCACTGACGCACACTGTGACTGCCCTTACGGCAAGTCTGGACCTAACTGCGATCAAGAATTAATACCAATCGATGTTCTATTCACCGGCGCTAGATCCTATCTGAAGCTGAAAGCTAGATCTATTTCTAGTGTGAGCTTAGCTCTGGAAGCGGAAATTAAACCTCAAAAGGAGAGGGGATTGATTGTATTCGTCGAAACGCCGCATTTCTATACGTCGCTTTCGCTTCAGGGTGGTTTGTTGGAGTATAGATGGACGGATAATTTGTCCGGTCTGACGTCACTGGTCCGCTCGGGGGTGGTGGTGTCGGTGTCACAATGGCACGGCGTGAGGGCGGGTCGCTATGGCAACCGGCTGTATGTGTGGGTAGACGGCGCCCTCAGTGGTATGAGAGATCTGTCTCGGTTACCGTTGGATGTTATGTCCGGCCCTCCTGAATCCTACAGCGGCTGCTTTAGGAACTTCCATCTAAATAATATATTGTTACCTCTCGAACAACAAAATATAGAAGAGGGTCAAAACGTGCTAGCGTGTGAAGGGTCTAGTTGCGGCGCTCGTTGTAGACGAGCGGCATGTTCCCGCGACACGTGTGCGGGGAGGTGTCGGCGCGGACGCTGCGTGTGTCCGGCGGGACGGGCGGGTGTTACTTGCAGGGAACATATAAACATAACGATACCTCAATTCGGAGGGGACGCCATGTTGACACTCAGTCGGAGCGATCGTCGAGAACAATTGATTGAAGCGTCACCCGCTCGAATAAAACTCAACTTCAACACCGCGGACCCGAACGGGCTCATAGTTTGGATCAATACGGGTATAGACTACTTCGGCGTTGGTCTCGAGAACGGATATATTAAACTTAGTTGGTCTGTACATTGTAACAATTCAAGTGGTCAAACTACGAGAGACTATTTTCCGTTACCACCAAAACTAACTCCGACTCTGGTCAGTGCGGGCTTCTTGGCGGACGGAGAGTGGCATTCGATTGCATTGACCCTAAGACATAACATCTCTTTGTCTATCGACGAAAAGTTATTCGTTGATCAAGAATGCATTCAAATTGAAGACGATGATGACACTGAGTTATTTATAGAATTAATACCAATCGATGTTCTATTCACCGGCGCTAGATCCTATCTGAAGCTGAAAGCTAGATCTATTTCTAGTGTGAGCTTAGCTCTGGAAGCGGAAATTAAACCTCAAAAGGAGAGGGGATTGATTGTATTCGTCGAAACGCCGCATTTCTATACGTCGCTTTCGCTTCAGGGTGGTTTATTGGAGTATAGATGGACGGATAATTTGTCCGGTCTGACGTCACTGGTCCGCTCGGGGGTGGTGGTGTCGGTGTCACAATGGCACGGCGTGAGGGCGGGTCGCTATGGCAACCGGCTGTATGTGTGGGTAGACGGCGCCCTCAGTACGGAACCCATGCTGGCGCACGCCTACCCGCATACAGCCAGCGAAGCATCCATCGTTATAGGCACGGATCACAATCAATCTATTTAA

Protein sequence:

>DPOGS214264-PA
MFKLLNRGHLKSTWWLLIVIPIASAGFACLNNPCVHGICIDDINSTYLCYCIDGYTGVQCQTNWDECWSNPCQNGGTCIDGVASYNCSCPDGFIGDNCETNYNECDSNPCYNNGTCIDMTNEYVCHCIPGFSGDHCELDVAVCNSTGEVRCHNGGECIEGPGFKFYCKCAAGWTGHKCEDQIDECESNPCRNGGICIDAHADYMCACTYGFTGKSCEVAIEFCSQDSCSEKALCVLEDVLRVCYCVPDYHGERCELQYDECALGPRCLNGGTCIDGVDNFTCSCPPRLTGLLCECLILDDGNYDCEYIRPTLLPDHSTATSSFTETIIIDTSTMESKYNSSTTTTSLSTIDSGTSIDVITTDMAIYTKLDNVTDIPITASSTDTIVTENLTTSTEISDMTTDSLTSTSISTSRSTTKEDSVTEIVTILIETKGTIGADDSKTEITTECSGSCPKGNFSTSDLPPTTITSIETTEGITTSTEITKQTATDTTVQTTVDLKETTKQMTSDTTEYTHQAQDMTTERMFTDSPVETTELATELTHPMTEIETSTGYNQISTAHSDCTDVICNNHGSCINTLHGVRCHCLFNYEGRFCESKIIVNSAAFDGTSYIAHHIKNSTSISIAFKAKTLILDGQVMYVDIAKGAYMKLYMNSGLLRFEFSCGYQTMLLSELKTHLNKGYIMKIETRLDIFLPENHCNGTLRLNDTVAMSGGQFANISSPEYNSILYFGNIPNANRNNSNEKSFIGCIKDLIINDERREIFSDAYEASEVRECSSLSCLSSPCVNGGTCNDDDDTYSCACANGWTGATCNDSVCDHNPCQSGGSCVHHPGSGFLCLCPYGRHGIFCEYNVEITRPSLSPISPGRSSYVLYPMSQSAANSDRFEMRLRFQTSDMDQIALLAFVGQRGRHDARSQHLALTFVKGYVMLTWNMGAGPRRIFTSRPLGPRRGGHTVRVWRRGRTAGLVVDGRYNVSGNAPAHTNNMTLLPYIYIGGHPSDDFRDLPHDLPLHSGWSGCVLEVTGQSGGGRGVGGRGVGQCGVTQCTAKSCNAPRGVCIHSPATYGCICNEGWFGATCASPRSPCDRSHSRCQGACVITLTDAHCDCPYGKSGPNCDQELIPIDVLFTGARSYLKLKARSISSVSLALEAEIKPQKERGLIVFVETPHFYTSLSLQGGLLEYRWTDNLSGLTSLVRSGVVVSVSQWHGVRAGRYGNRLYVWVDGALSGMRDLSRLPLDVMSGPPESYSGCFRNFHLNNILLPLEQQNIEEGQNVLACEGSSCGARCRRAACSRDTCAGRCRRGRCVCPAGRAGVTCREHINITIPQFGGDAMLTLSRSDRREQLIEASPARIKLNFNTADPNGLIVWINTGIDYFGVGLENGYIKLSWSVHCNNSSGQTTRDYFPLPPKLTPTLVSAGFLADGEWHSIALTLRHNISLSIDEKLFVDQECIQIEDDDDTELFIELIPIDVLFTGARSYLKLKARSISSVSLALEAEIKPQKERGLIVFVETPHFYTSLSLQGGLLEYRWTDNLSGLTSLVRSGVVVSVSQWHGVRAGRYGNRLYVWVDGALSTEPMLAHAYPHTASEASIVIGTDHNQSI-