Monarch geneset OGS2.0

DPOGS200722
TranscriptDPOGS200722-TA2280 bp
ProteinDPOGS200722-PA759 aa
Genomic positionDPSCF300030 - 179944-188987
RNAseq coverage100x (Rank: top 61%)
Annotation
HeliconiusHMEL0089680.096.91% 
BombyxBGIBMGA001121-TA0.070.36% 
Drosophiladp-PC2e-2432.86% 
EBI UniRef50UniRef50_E2AYI10.057.35%Protein kinase C-binding protein NELL1 n=1 Tax=Camponotus floridanus RepID=E2AYI1_CAMFO
NCBI RefSeqXP_002433124.10.056.98%protein kinase C-binding protein NELL1 precursor, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3838563900.057.96%PREDICTED: protein kinase C-binding protein NELL1-like [Megachile rotundata]
NCBI nr blastxgi|3838563900.053.58%PREDICTED: protein kinase C-binding protein NELL1-like [Megachile rotundata]
Group
Gene OntologyGO:00055157.5e-16protein binding
GO:00071554.4e-14cell adhesion
GO:00051984.4e-14structural molecule activity
GO:00055094.2e-11calcium ion binding
KEGG pathway 
InterPro domain[2-147] IPR0089855.1e-34Concanavalin A-like lectin/glucanase
[214-215] IPR0133201e-18Concanavalin A-like lectin/glucanase, subgroup
[199-252] IPR0010077.5e-16von Willebrand factor, type C
[1-139] IPR0031294.4e-14Laminin G, thrombospondin-type, N-terminal
[477-524] IPR0018814.2e-11EGF-like calcium-binding
[358-402] IPR0130911.4e-09EGF calcium-binding
[24-135] IPR0126801.1e-07Laminin G, subdomain 2
[8-138] IPR0017914.8e-06Laminin G domain
Orthology groupMCL14623 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200722-TA
ATGGAGCTTCTGCGACGATCGCCAGAGTTCACAGTCCTAGCCGCTCTTCGACAGGAGCCGGCTAACTCGGGAACAATTCTATCCTTCTCACACGGATACAACAGGTATCTGGAGTTGCAGTCAAGTGGTCGTCGTGATGAGGTGCGTCTTCATTACGTGGAGGCAGGTGGCGTAACGGCCCGAGTGGAGACCTTCCCGTTTCGACTCGCGGACGGCGCATGGCATCGGGTGGCACTTGCTGTCTCAGGGGCGCAAGCAACTCTGCTCGTTGACTGTCACCCGCTATATCGACGATTAATACCACCACCAGACCGGAATTTTACACAACCACAACTCTCATTGTGGGTAGGACAGAGAAATAGCAAGCATTCTTTATTTAAGGGAACCCTTCAAGATGTTAGATTGGTGAGTGGGCCTCACGGCTATTTGGTACAGTGTCCGGGACTGGACTCTGAATGCCCTACTTGCGGGCAGTTCTCACTGCTACAAGCCACTGTACAGGAACTAACTTCACATATCCATGACCTTTCACTAAAGCTTGTTGGCGCCGAAGCAAGACTGGCGCGTTTAGAACAATGTGATTGCCAAAAATCGTGTTACTCTAATGGGACAGTGCACGCAGATGGTGCAACTTGGCAAAAAGACTGCAATCGCTGCTCTTGCGTGCATGGTGAAATAACGTGCAGGCCAGTAGAATGCGACAGAGCGGAATGTAAAAATCCAGTGTTACATCCAGGAGAATGCTGTCCCACGTGTCTGAGACAGTGCCTCCTAAAGGGCACGCTTTATGAACACGGCGAGCGATTCGCTCCCAAAGAGTGCGCGGAGTGTGTTTGTCACGACGGTAATATGCAATGCGCACGCGTCGATCCCGACACAGCCTGCCCACCGCTACCCTGCGACGCCCCGGACCAGTTTACTGTACCCGGGGAGTGTTGCAAGTTCTGCCCTGGTGTGGACTACTGTAGTATGGGACATTCGTGCGATGAAAATGCTACTTGTATGAATCTTAATACAAAGTACACTTGTAAATGCAATCAAGGATTCCAAGGGGATGGAATCACATGTGAAGATGTAGATGAATGTCAAGCGGCTGGTGGTCTTTACGGTCACCACTGTCATTCCAACACTCGTTGTGTGAACGTCGTAGGAGGGTACGTGTGTCAATGTCTTCCGGGATACACCAGGAGGGATAAATTCAACTGTGTTGAGGTGGACGAGTGTTTGAGTGACACACATGGATGCGATCCTCACGCCGAGTGCAGTAACACGCCTGGTTCATACACCTGTCTGTGTAGGGAGGGATACTCCGGAGACGGTTATACATGTACACCTATATGTAGCGGAGGTTGTCTAAACGGCGGTGTATGTGCCAGTCCGGAGCACTGCGCGTGCGCACGCGGTTTCGCTGGCGCTCGTTGCGAGCGGGACGTTGACGAATGCTTGCGTGCGGCTCACCCCGCAGCGCCGAGAGCTTGCGTGCCGCGAGCCGCGTGCGTCAACACCCCTGGCTCATACTACTGCGTCTGCAAGAACGGCTATAGAAGAGACCCCCATAGAGATCACTGCGAAGATGTTGATGAATGTGCTGAAGGCTTTCATACCTGTCATCCAAGCGCACGGTGTGTTAACACGGACGGAGGATTCAGATGTGAATGTGATACAGAAAATTGTGAATTGAGTTGTTCATGGCAAGGCCGCATCGTGTCTGACGGCGGGCGGTGGTCGGAAGGCGGTGGATGTCGGGCATGTTCGTGTGCCAGTGGGGTGGCTACCTGCGAGGATGCTGTGTGCGCCTGTGACACGGACAACACGTCGCTAACTTTCCAGAGCTCGGAGTCTATTCCCCTGGCTCCGTCGTCCTGCTGTCCTCACTGCGATTCTCGGTATCACTGCCGTCACCAGGAGATGCACCACGTAACCTTCCGTAGCGGCGAACGCTGGCTCTACCAGTGCCAGATTTGTGAATGCCTCCTGGGTGAGGTGGACTGCTGGGAGCCCGAGTGCGAGGATGGTGGAGGGTGCTGCGCCTTTGACACGGGGGAAGCCCCGGGGAGCAAGACCCGGGGCGAAGGGGAGACCTGGCGCACGCCCCACCGCCTCGAGCTCGCGGGCTGCGCGCCACCACACTGCCCCACGTGCCAGGGCGGGCAGTGTGCTACTTTGAGCTCGGCGCGACGCGGCGGCGGCGGCAGCCCTGGCCCTGGCGGCGCTGTGGTGGGGCCGCGCCCCACGACCTCACCGCAAGCACCGCGGCGCGCGGCGCTAGAGCCGCCCTGA

Protein sequence:

>DPOGS200722-PA
MELLRRSPEFTVLAALRQEPANSGTILSFSHGYNRYLELQSSGRRDEVRLHYVEAGGVTARVETFPFRLADGAWHRVALAVSGAQATLLVDCHPLYRRLIPPPDRNFTQPQLSLWVGQRNSKHSLFKGTLQDVRLVSGPHGYLVQCPGLDSECPTCGQFSLLQATVQELTSHIHDLSLKLVGAEARLARLEQCDCQKSCYSNGTVHADGATWQKDCNRCSCVHGEITCRPVECDRAECKNPVLHPGECCPTCLRQCLLKGTLYEHGERFAPKECAECVCHDGNMQCARVDPDTACPPLPCDAPDQFTVPGECCKFCPGVDYCSMGHSCDENATCMNLNTKYTCKCNQGFQGDGITCEDVDECQAAGGLYGHHCHSNTRCVNVVGGYVCQCLPGYTRRDKFNCVEVDECLSDTHGCDPHAECSNTPGSYTCLCREGYSGDGYTCTPICSGGCLNGGVCASPEHCACARGFAGARCERDVDECLRAAHPAAPRACVPRAACVNTPGSYYCVCKNGYRRDPHRDHCEDVDECAEGFHTCHPSARCVNTDGGFRCECDTENCELSCSWQGRIVSDGGRWSEGGGCRACSCASGVATCEDAVCACDTDNTSLTFQSSESIPLAPSSCCPHCDSRYHCRHQEMHHVTFRSGERWLYQCQICECLLGEVDCWEPECEDGGGCCAFDTGEAPGSKTRGEGETWRTPHRLELAGCAPPHCPTCQGGQCATLSSARRGGGGSPGPGGAVVGPRPTTSPQAPRRAALEPP-