Monarch geneset OGS2.0

DPOGS215296
TranscriptDPOGS215296-TA6393 bp
ProteinDPOGS215296-PA2130 aa
Genomic positionDPSCF300120 - 430417-462473
RNAseq coverage874x (Rank: top 15%)
Annotation
HeliconiusHMEL0086500.065.65% 
BombyxBGIBMGA007609-TA0.059.12% 
Drosophilacrb-PC0.043.88% 
EBI UniRef50UniRef50_P100400.043.04%Protein crumbs n=7 Tax=Coelomata RepID=CRB_DROME
NCBI RefSeqXP_310433.40.045.27%AGAP003873-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3479708080.044.33%AGAP003873-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3479708080.044.60%AGAP003873-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00055091.7e-13calcium ion binding
GO:00055153.5e-09protein binding
KEGG pathwayspu:5750273e-127 
 K02599 (NOTCH)maps-> Dorso-ventral axis formation
    Notch signaling pathway
InterPro domain[935-1182] IPR0133206.1e-27Concanavalin A-like lectin/glucanase, subgroup
[955-1166] IPR0089853.8e-23Concanavalin A-like lectin/glucanase
[1236-1388] IPR0017911.5e-17Laminin G domain
[1244-1387] IPR0126797.8e-17Laminin G, subdomain 1
[740-776] IPR0018811.7e-13EGF-like calcium-binding
[1428-1457] IPR0062093.5e-09EGF
[743-776] IPR0062101.4e-08Epidermal growth factor-like
[586-618] IPR0130911.8e-08EGF calcium-binding
Orthology groupMCL10375 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215296-TA
ATGAGCTGCGAGGTCCAGCGGGTCCCGGGGTGTGGACCACACTCATACAGCAGCGAGGGGGCGCTAGAGATGTTGAACACCTCACAGGGTTCGAAGAATGGGTGCGTGTTCCTGTCTCTGTTGGTGTGGTGTCAATGCTCGCAGGCGGCGGGTCCTCTGGGCCAGTCGGAGCGCCCGGAGGCCTACTTCAACGGCTCATCGTACATCCGACTCGCCACTCCGTTCTCACTAAAACAACTCGTGGGTCTCAGCTTCAGGACATGTGTCGGTGGTGAGCTGTTCTCTCAGCGGTTCGAGGGTTACACGCTGCATGTGACGGCCCTGCTGGAGCAGGTGGTCGTGTCGTGGGCGAGGCCCGGGCTGAGTCCGCGAGAGGTGGGGCTCGCGCGGGAGACGCTCGATAACCGCTGGCACTGGGTGGCGCTCCGCTACCTGCCCAGCCCGCCGGCCCTGCTGCTAGAAGTCGATAAAGATACACAGGTGATATCGAACGTGACATGGAACCCAGAGCTGTTGTCGATGGGAGCCCTGGAGGCGGGCGGCGCGGTGGTGCTGGTGGGGAACCTGTTCTCGGGCTGCGTGCACGAGGGTCCTCAGCTGGAGTTTCACGCAGCACACGTACTGAAAGCGAGTGCTCGCTTTACGAGTTGTCCCCTCACTACGGATGCATGCAAAGACGGGAAGGACGTTCTGCGGATACCCCCAAAAGATCACTGCTATAACGAACCGTGCCTCCGACACGGCACCTGCATCTCGAGACACGATAAGTACGAGTGTCACTGTACAGCGCGGTACACCGGCAACAACTGCGAGGTGGACAAAGGTGACCCGTGCGCCTCCAGCCCCTGCTGGCACGGAGCGCGCTGTGTGGAGGACGCCAGAGGAGACTACACGTGCCTGTGTCCGCCGCTATACCGCGGCGTTCACTGCGAGCTGGAGGAGTCCCTGGACCCGCAGTGCGCCGCCCAGCCGTGCCGCAACAACGGCAGCTGCAGCGTGCCGCCCGGGAGCGACGAGTACGTGTGCGAATGTGCTCCAGGTTACAGCGGACGAAACTGTGAGACCGACGTGGACGACTGTGCGGAGGCCGGCCAGGACGCCTGTCTGAACGGCGGACGGTGTGTGGACGAGGTCGACAACTACACCTGCGACTGCAGCGGAACCGGATACACGGGCCCGCGCTGCGAGGCCAACGTCAACGAGTGTGAGGAGGAGCGCGGCGTGTGCGGACACGGCGCCTGCTACGACACGTACGGCGGGTTCGTGTGCGCCTGTCTGCCGGGCTTCACGGGCGAGCGCTGCCACAAGATGTCGGCCTGCGCCTCCGGCCCGTGCGGCGCGGGAGGGGCCTGCGTCGAGGAGAACGGCGGCGCGGGCTTCCGCTGCGTGTGCGCGAGGGGCCTTTCCCGTATGGTGTGCGCCTGCGACCTGGGCTTCTACGGGCCGGCGGGCGCTGCGCCCGACTGCTCGTCGCTGGAGACGGCGTGCGAGGCCGGCGTGTGTCTCAACGGGGCCACGTGCCTCCGCTCCCAGGACAGGCTGGAGTGCGCGAGCGGGTTCCGTTGTGAGTGCGAGGCCGGCTGGAGCGGGCCGCGGTGCGAGGTGGAGGCGGTCGCGGGGGAGGGAGCCCCGGGCGCGGCCGAGGAGGACCCCTGCTCCTCCACACCCTGCAACAACGGCACCTGCACCGTCAGAGACAGCGCCGCCTTCGACTGCACGTGTCCGCCGGGAATCACCGGAAAATTTTGTGAAAACGACATTGACGAGTGTGCACTGATGCCAGACATATGTAACAATGGGAAATGTTTCAATATGCCCGGCTCCTACCAGTGTTATTGCAAGCCAGGATACACGGGAATAAGCTGCGAGCAGGATATCGATGAATGCTTGTCTTTGCCTTGCAAGAACGGGGGGACGTGCCAAAACTTGGAGAATAATTATGAATGTACCTGTGTCGAGGGTTTCGAAGGCAAGGATTGTTCAATAAACATCAACGAGTGTGCGGCCAACCCCTGCGCCGCCGGCTCCACCTGCATCGACGGCGTGGCCAGCTACCGCTGCATCTGCCAGGAGGGACTCACCGGACCACACTGCGAGACGAACATTGACGACTGTGAATCTCAGCCTTGCTCGCACGGCGGACAGTGCATAGACGGTCTGAACGGCTACACATGCGAGTGCGCCGGCACCGGGTACGCGGGCGACGACTGCGAGTTCAACATCGACGAGTGCGCCTCCCTGCCGTGCCGCAACGGAGGCACCTGCATCGACGACGTCAACGACTACCACTGCGTGTGCCACCCGGGCTTCACCGACAAGAACTGCTCCACGGACCTGGACGAGTGCGAGAGCTCCCCCTGCCTGCACGGCGGCGTGTGTCTGCAGCGCTCGAACGTGTCCCTGTACCGCGCCCCGGACGCCCCGCCGGTCCGCGTGGGGCCGCAGCCGCACATGGTGCTGCCGGATGTGTTCTACCGGCCCTTCTCCCCCGAGACGGCTGGCGGCTACGAGTGCGTGTGCGTGGCGGGGACCTCGGGCGCCCGCTGCGAACACAACGTGGACGAGTGCGCCTCCTCGCCCTGTCGCAACGGCAAGTGCGTAGACGCCGTCGGCGGGTACGCTTGCCACTGCGCGCCCGGGTACGAGGGCGAGCATTGCGAGCTGGAAATCGACGAGTGCGCGCGCTACGCCCCCTGCGAGCACGGCCGCTGCCACGACCGACCGGCCTCCTACTACTGTTCGTGCGAGGCGGGCTGGGGAGGCCGGAACTGCTCCGTCGTGCTGACGGGCTGCCACGGCGCGCCCTGCCGCAATAACGGCACCTGTCTGCCCTGGCTGGTGCGTGAGGACGAGCACCGTTTCAACTGTTCGTGCGCCCCGGGCTACTATGGACCCGCTTGCGAGAAGATCACTACGATGTCTCTGGAAAAATTCAGTTATGCGGAAGTGAACACGTCGCGGGAGGAGGGCTACGACATATCGTTCCGGTTCAAGACCACCCTGGGCAGCGGACTCCTCGCCATGGGACGCGGTCTGACCTTCTTCTTCCTGGAACTGTCCGAGGGAAGACTCAACCTGCAGTCGAGTTTGTTGAATAAGTGGGAAGGGGTGTTCATAGGATCTAACCTGAACGATAGTAATTGGCAGAAGGTGTTCGTGACAGTGAACACTTCTCACCTAGTGCTGGCAGCCAACGAGGAGCAGACCATCTATCCCATCAACCAGAACGAGGCGACCAACGCGTCTTGGACCTCGTTCCCGTCAACTCGCCTGGGCACGGCCGGCTCCTCCTACGTGACTCTCCGCCACGGCCCTGCCAGACACTCCAGGAATTTCTTCGTGGGCTGCTTTCAGGACGTCGTCGTCAACGGACAATGGGTCCTCCCCGAAGAGTCCAATTCGGGTTCAGGCGAGGCCCGCACGGAGCCTCCGCCCGAGGAAGAAGCCGCAGGTGGCGGAGCGCGGGCTGTACTCCGCGGGGTGCTAGCGTCTTGTCCCCGGACACCTCAGTGCGCTCCCAACCCTTGTCGCTCAGGCGGAGTGTGCGAGGACCGCTGGACCGCTTTCCGCTGCACCTGCCCTCGTCCGCACCTCGGCGACACGTGCCAGTACAACTACACGGCGGCGACGTTCGGCCAGGAACTGGCGCGGCCACGCTCCGTGGTGACGGTGGCGGTGCGCGAGGCGGCGCGCCGGGCCGTGCACGCGGCACTAGACATATCGATGTTCATCCGCACCCGCAAGTCCACCGGCCAGATCTTCTACCTGGGCTCGCTGCCGCGCTACGGCCAGCCCGACGATACGCTTGTGGCCGCCTCGCTGAAGGCCGGCGAACTGCTGGTGCACCTGCGATTCAACCAGACGCCCGAGGACTACACGGTGGGCGGCACGCGCCTCGACAACGGACACCTGCACCTCATCCAGGTGGTCCGTAACTCGACGCTGGTGCAGGTGAAACTCAACGGCACCGAGTACTTCCGCAAATCCATCTCGGCCGCTAAGCAGCTCGATGCGCAGGTGCTGTACCTGGGCGGCCCCCCGCCCCCGCCACAGCCCCCGGCGGTCGCGGAGGGCAGCTCCCCACCCACTCCTCCGGTCACCGAGGAGCCCGACGACGACGATTACTTCAAAGGAGTGATCCAGGACGTGCAGGTCTCCAACGGCGTCAACGTGACCGTCGTGGAGTTCTTCCCCCTGCAGGGCGCCCGCGTGCCGCCGCCCTTCGGGGACGTGACGCTGGACCCGCTGGGCGTGTTGCCGGGAGAGGTCTCGGACGACGCGTGCGCTTCTCGGCCCTGTCTGCACGACGCCGAGTGCCGCACCACCTGGAACGACTACGCCTGCATCTGTCCCTCCGGTTACAAGGGGAAGCAGTGCGCGGAGGTGGAATTCTGTCAGCTGCAGGGCTGCCCGCTTAACTCCCACTGCCGCAATCTGGACCGCGGCTACGAATGCGTCTCCAATGCCACCTTCGACGGCGTCGACACGGTCCTCAGCTACCGGCTGCGGGAGCCGGGCGGCGCACACGCGCTGGACCCGCCCGCGCCGCCCGCCTCGCTCACCCTCACCTATCGCAGCAAGGCCGGCGGCACGCTGTTCCGCGCAGAGCGCGAGGACGGCGAGGACGCCGTGTTTAGCGTCGGACTCTACAACGGCCAGGTCGGCGTGCAATGGCGGCTGGGCGGACTGCCGGCGCAGCGCCGCATGCGACTCAGGCGCCCCGGCCTCGCCTGGGTCACGCTGCGACTCACGCTGGCAGACGGAAGGATCAACGGGTCTTTCGTGGACGCCGACGGCCACGAGGAGCTGGGCCTGTCCGACGACATCGACGTGGCGGCCTGGCAGCGTCTCGTCACGACCGGACTCATCACGCTGGGAGGCGTGCGACCCAGTCCGCGACCACCCCTCACCACCACCACCTCCACCACCACCATCCTACCGGCCATGGAGAATTCCACGGACCCGAATGTTATGGAGTACTCCGAGGATGACCTCATAGATGACAACTTGGCCGGAGAATATTTCAAAGGCTGCCTGGGAGCGGTTCACGTGGGCGGCTTGCTGCTGCCCTTCTTCACGGAGGAGCAGTTGTTCGTGGGGTCGGCGGCGACGTTGCTCGCCGCCCAGCCTCACTACGCACTGCTCGGGGGCGAACCCTGGGGAGCCTCCGAGGGCGCGGGCTGTGTGCTGTGTGTGGAGGCGCAGTGCCAGCGGGGTGCGCACTGCCAGGACGTGCGGAACTCGTACGCCTGCGCCTGTCCTCCCGGGTACGCCGGCGACTACTGCGAGCGAGACATCGACGAGTGCCTCCACCACGACTGTCGGAACGGCGCCACCTGCAAGGACGAAGTCGCCAAGTACAGCTGCCTCTGTCCCGCCGGCTACGACGGAGACCTGTGTGAAAACGACATCGACGAGTGCGCGTCCGGTCCGTGCGGTCACGGGGCGACCTGTGTGGACGAGCCGGGCGGCTTCCGCTGCGAGTGTCCGGCCGAGTGGCGCGGGGTCCGCTGCGACTCGCCCGCCGCCACCACCTGCCGCCACGCGCCCTGCGCCAACGACGCGCAGTGTCAGGACACGCCCGATCCGGTGACCGGGAACAACTACACGTGTTCGTGCGGCGCGGACTACGCGGGCGTCCACTGCGAGCTGGCGTTCTGCGAGGTGACGGCCTGCGTCCACGGCGCGTGCGCGTCGGAAACTCGACCCCCGACGTGCTCGTGCAGGGCGGGCTGGGGCGGACGGCTGTGCGACGAGGAGCGGGACGAGTGCGCGGACGCCGCCTGTCTGCATGGGGCGCGCTGTGTGCCCACCAGACCGCCCACGTGCGACTGCCCACCAGAGTGGCGCGGAGCTCGCTGTGAGCTGGACGTGGACGAGTGCAGCGAGCGACCCGTCAGCTGCGGCCCGGGGCTGTGCCGGAACCTGCCCGGGACATACACATGCGAGTGTCCGGAGGGCTTCTGTGGCGACGGATGCGTGCTGCCTGACCCTTGTGTGCGATGGGAGGCGGCGACGGAGCGGGGGGAGGAGGGGGACGGAGGCCCCGGGCCCTGTCTGCACGGAGGACGGTGCGAGCAGCGCTGTGGCTCAGAAGTGGACTACATGTGTCACTGTGTGGACGGCTGGGGCGGCAAGAACTGCTCTCGTAGGGTGGAGGCCAGCGAGGCGACCGGCGGGGTGTCCCGAGCGGCGCGGTGGGCGCTCGGCGCGGGCGGAGCGCTGCTGGCGCTGCTGCTGGCGGCCGGGGCGCTGGCGGCGCTCGCGGCACAGGCCCGCCGGAAGAGAGCCACCAGAGGAACATACTCGCCCTCCGGACAGGAGTACTGCAACCCGCGCGCCGAAATGATCACGCACGCGCTCAAGCCGCCGCCCGAGGAGAGGCTCATATAG

Protein sequence:

>DPOGS215296-PA
MSCEVQRVPGCGPHSYSSEGALEMLNTSQGSKNGCVFLSLLVWCQCSQAAGPLGQSERPEAYFNGSSYIRLATPFSLKQLVGLSFRTCVGGELFSQRFEGYTLHVTALLEQVVVSWARPGLSPREVGLARETLDNRWHWVALRYLPSPPALLLEVDKDTQVISNVTWNPELLSMGALEAGGAVVLVGNLFSGCVHEGPQLEFHAAHVLKASARFTSCPLTTDACKDGKDVLRIPPKDHCYNEPCLRHGTCISRHDKYECHCTARYTGNNCEVDKGDPCASSPCWHGARCVEDARGDYTCLCPPLYRGVHCELEESLDPQCAAQPCRNNGSCSVPPGSDEYVCECAPGYSGRNCETDVDDCAEAGQDACLNGGRCVDEVDNYTCDCSGTGYTGPRCEANVNECEEERGVCGHGACYDTYGGFVCACLPGFTGERCHKMSACASGPCGAGGACVEENGGAGFRCVCARGLSRMVCACDLGFYGPAGAAPDCSSLETACEAGVCLNGATCLRSQDRLECASGFRCECEAGWSGPRCEVEAVAGEGAPGAAEEDPCSSTPCNNGTCTVRDSAAFDCTCPPGITGKFCENDIDECALMPDICNNGKCFNMPGSYQCYCKPGYTGISCEQDIDECLSLPCKNGGTCQNLENNYECTCVEGFEGKDCSININECAANPCAAGSTCIDGVASYRCICQEGLTGPHCETNIDDCESQPCSHGGQCIDGLNGYTCECAGTGYAGDDCEFNIDECASLPCRNGGTCIDDVNDYHCVCHPGFTDKNCSTDLDECESSPCLHGGVCLQRSNVSLYRAPDAPPVRVGPQPHMVLPDVFYRPFSPETAGGYECVCVAGTSGARCEHNVDECASSPCRNGKCVDAVGGYACHCAPGYEGEHCELEIDECARYAPCEHGRCHDRPASYYCSCEAGWGGRNCSVVLTGCHGAPCRNNGTCLPWLVREDEHRFNCSCAPGYYGPACEKITTMSLEKFSYAEVNTSREEGYDISFRFKTTLGSGLLAMGRGLTFFFLELSEGRLNLQSSLLNKWEGVFIGSNLNDSNWQKVFVTVNTSHLVLAANEEQTIYPINQNEATNASWTSFPSTRLGTAGSSYVTLRHGPARHSRNFFVGCFQDVVVNGQWVLPEESNSGSGEARTEPPPEEEAAGGGARAVLRGVLASCPRTPQCAPNPCRSGGVCEDRWTAFRCTCPRPHLGDTCQYNYTAATFGQELARPRSVVTVAVREAARRAVHAALDISMFIRTRKSTGQIFYLGSLPRYGQPDDTLVAASLKAGELLVHLRFNQTPEDYTVGGTRLDNGHLHLIQVVRNSTLVQVKLNGTEYFRKSISAAKQLDAQVLYLGGPPPPPQPPAVAEGSSPPTPPVTEEPDDDDYFKGVIQDVQVSNGVNVTVVEFFPLQGARVPPPFGDVTLDPLGVLPGEVSDDACASRPCLHDAECRTTWNDYACICPSGYKGKQCAEVEFCQLQGCPLNSHCRNLDRGYECVSNATFDGVDTVLSYRLREPGGAHALDPPAPPASLTLTYRSKAGGTLFRAEREDGEDAVFSVGLYNGQVGVQWRLGGLPAQRRMRLRRPGLAWVTLRLTLADGRINGSFVDADGHEELGLSDDIDVAAWQRLVTTGLITLGGVRPSPRPPLTTTTSTTTILPAMENSTDPNVMEYSEDDLIDDNLAGEYFKGCLGAVHVGGLLLPFFTEEQLFVGSAATLLAAQPHYALLGGEPWGASEGAGCVLCVEAQCQRGAHCQDVRNSYACACPPGYAGDYCERDIDECLHHDCRNGATCKDEVAKYSCLCPAGYDGDLCENDIDECASGPCGHGATCVDEPGGFRCECPAEWRGVRCDSPAATTCRHAPCANDAQCQDTPDPVTGNNYTCSCGADYAGVHCELAFCEVTACVHGACASETRPPTCSCRAGWGGRLCDEERDECADAACLHGARCVPTRPPTCDCPPEWRGARCELDVDECSERPVSCGPGLCRNLPGTYTCECPEGFCGDGCVLPDPCVRWEAATERGEEGDGGPGPCLHGGRCEQRCGSEVDYMCHCVDGWGGKNCSRRVEASEATGGVSRAARWALGAGGALLALLLAAGALAALAAQARRKRATRGTYSPSGQEYCNPRAEMITHALKPPPEERLI-