Monarch geneset OGS2.0

DPOGS203934
TranscriptDPOGS203934-TA4908 bp
ProteinDPOGS203934-PA1635 aa
Genomic positionDPSCF300005 - 355487-366809
RNAseq coverage447x (Rank: top 27%)
Annotation
HeliconiusHMEL0135190.089.02% 
BombyxBGIBMGA002114-TA0.084.66% 
DrosophilaLanB2-PA0.049.75% 
EBI UniRef50UniRef50_B4MMT40.049.48%GK16620 n=8 Tax=Pancrustacea RepID=B4MMT4_DROWI
NCBI RefSeqXP_002430825.10.052.06%laminin A chain, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420207720.052.06%laminin A chain, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420207720.052.79%laminin A chain, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00310122.7e-27extracellular matrix
GO:00071552.7e-27cell adhesion
KEGG pathwayaag:AaeL_AAEL0051870.0 
 K05635 (LAMC1)maps-> Small cell lung cancer
    Pathways in cancer
    Amoebiasis
    Prion diseases
    Focal adhesion
    ECM-receptor interaction
InterPro domain[39-276] IPR0082111.2e-106Laminin, N-terminal
[556-689] IPR0000342.7e-27Laminin B type IV
[551-676] IPR0180318.7e-26Laminin B, subgroup
[938-986] IPR0020493.6e-12EGF-like, laminin
Orthology groupMCL10759 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203934-TA
ATGGCGCGCCCATTCATTTACACATACTTTCTAACGTTTATGGCGATATCGTCAGCACAGGATAACTTCCATACACATAGGACATCACAAAAACTAGCTTCCTGTTATAGAAATGATGGACAACCCCAAAGATGTATACCGGAATTTGAGAATGCCGCTTTCATGGTACAAATGGAGACGACGAACACATGTGGAGACAACGGAGGAAAGATGTATTGCATACAAACCTCGGCAGGAACATCTATGCGTTCCTGTGACTTCTGTCAGCCGGGACAATTTTCAAGTTATTATTTAACTGATCTCCATTACGAACAAGACAATCAGACGTGGTGGCAGTCAGAAACAATGAAGGAAGGAATACAGTATCCAAATCAAGTCAATTTAACGTTACATTTAGGGAAGGCATATGATATCACCTATGTACGTATCGTTTTCTACTCTCCAAGGCCCCAAAGTTTTGCAATTTATAAAAAGGCCAGCGAAGATTCAGAATGGGAACCCTATCAGTATTTTAGTGCTTCATGTCGTGATACATATGGTGTATTGGAACAGCGCGCTGCGGAAATAGGAGCCGAGACGAAAGCACTTTGCACTAGCGAGTATTCAGATATTTCGCCATTGTCTGGAGGAAACGTTTTATTTTCTACTCTCGAAGGCAGACCTTCGGCTTATACTTTTGATAGCAGTCCTGAGTTACAAGAATGGGTTACTGCTACTGACTTGCGTATTTCCCTTGATCGCCTTAACACCTTCGGTGACGAGATATTCGGCGACGTACAAGTATTGCAATCTTATTGGTACGCCATTGCTGATGTAGCTGTCGGCGCTCGATGCAAGTGTAATGGCCATGCTTCTGTTTGTGAAAACCAGGAAATGCCTGATGGTTCTCGAGTTAGATATTGCAGATGCGAGCACAATACTGCAGGCAAGGAATGTGAAAGATGTTTAGATTTTTACAACGATGCTCCATGGGGTCGCGCTTCACCAACTAATGTACATGAATGCAAGGCATGTAATTGCAACGGGTTCTCCAATAAGTGTTACTTTGATAAGGATCTTTATGAGAATACGGGGCATGGAGGACATTGTATGGACTGTTCTGAAAATCGAGACGGTCCTAACTGCGAACGCTGTAAGGAGAATTATTTCCAGAGTATGCAGGACATATGCATGCCTTGTAATTGCAACCCTACAGGTTCGAGAAGTCTACAGTGTAATGCCGAAGGAAAATGTCAATGTAAACCAGGTGTGACAGGTGATAAGTGCGACATTTGTGCACCTAATCATTATGAATTTACAAACCAGGGATGCAAGCCCTGTGGCTGTAATGAATCCGGATCTTTTGATAACACTCCACAGTGCGATCCTATTACTGGACGCTGTTTCTGCAAACAAAATGTCGAAGGAAAGCAATGTAGAGAGTGCAAGCCCGGTTTCTTTAATTTAGACTTGGAAAATGAATTTGGATGTACGCCATGCTTCTGTTTTGGTCATTCTTCTCAATGTACTTCAGCGCCCAAGTACCAAGCCCACGAGCTGAGTGCACATTTCATAAGAGATGCGGAAAAATGGAATGCTGAAGACAGCAATCACAAACCAGCTACTCTTCAATTTAATGCGAATACCCAAAATATTGCTGTTTCTTCAAAGGATACCGAAGTTGTGTACTTCCTCGCTTCAAACCAATTCCTTGGAGATCAAAGGCAGTCTTATAATCACGACTTGAAATTTAACCTACGTCTGGGTGAAAAACGGGGCTATCCTTCCTCTCAAGATATTATTCTTGAAGGCTCTCGCACTTCAATATCGATGAATATATACGGTCAAAATAATCCCGAGCCTACTGATCAGGGTCAAGAATACGCATTTAGACTTCACGAGGATCCTCGGTATGGGTGGACCCCCACACTTTCAAACTTTGAGTTTATATCTATTCTTCAAAACTTAACGGCTATTAAAATAAGAGGAACTTACAACAAAGGTGGACAGGGATACTTAATGAATTTCAAATTGGATACGGCTAAGATCGGTAGAGAAAAAGGATCTGCACCGGCAAACTGGGTTGAAAAGTGTTCTTGTCCAAAAGCGTACGTTGGTGATTATTGCGAAGAATGTGCACCAGGTTTTAAGCATGAACCAGCAAATGGAGGTCCTTACTCGACTTGTATTCCTTGCGATTGTAATGGTCATGCACACATATGTGATACAGCTACTGGCTTTTGTATTTGTAAGCATAACACCACTGGCAGCAATTGTGAGTTGTGTGCAAAAGGCTTCTATGGAAATGCTATAGCTGGAACACCTGACGACTGCAAGCCGTGTCCCTGTCCTAAAGACAGCGGTTGTATACAACTCATGGATCAAAGTATTGTTTGTACAGACTGTCCTTCTGGATATGCCGGCCCAAGATGTGAGGTATGCGCAGACGGCCATTTTGGTGATCCTACTGGTCAGTTTGGAAATTCACAAGAGTGTGTAGAGTGCCAATGTAATGGAAACGTCGATCCTAACGCAGTTGGAAATTGTAATAGAACTACTGGAGAATGTCTTAAATGTATTTACAATACAGCAGGAGAACATTGCGATAAATGTTTAAGCGGCTATTTTGGCGATGCCTTGGATCAAAAAAAGAAGGGCGATTGCAAGCCTTGTCAGTGCCATGAGGCTGGTACTTTAGTGAGTGCAGAGGGACCTCCTCAATGTGACGGTCTTACAGGGTTTTGTTCATGTATGCCTCATGTTATCGGAAAGAATTGCGACCAATGTGAGGATGGATACTTTGACATCAGTTCTGGCGAGGGATGCCGAGCTTGCGATTGTAATTTAGAAGGCTCCTACAATGGTACCTGTAATTCCGTCACCGGACAATGTTACTGTAAACCTGGAATAGACGGGATTCACTGCGACCGTTGCCTCGCTTATCACTATGGGTTTTCGAGTGATGGCTGTAATAATTGTGATTGCGATGAGTGGGGTTCCACTAACTATCAGTGTGACATGCTCGGACAGTGTTCTTGCCAACAAAACGTCGAAGGGCGGCGATGTGATCGGTGTATGGAAAATAAGAGACCACGCACTGATGGACAGGGCTGTGAAGATTGTCCGCCATGTTACAACCTCGTTCAAGATGCAGTTAATCAGCACAGAAAGGAGCTTAAGGAGTTAGATAATATACTTGGAAAAATATCGAAAGCACCGACAGTTATTGAAAATGCGGACTTTGATAATGAGTTACAACGTGTGAGAGCTGATATAGATAGACTCGTTCAAGAAGCAGAAGCAGAACTTGGTAATGGGCCTAGTTCAAGTTTAACTAACAATTTGGCTGACCTTTCTGATCGACTCGCTGATGTCAGAAACATGTTGTTCAAGATCGAGGATGAGAGCTATGAAGGTAATGAATCTATAGAAAGAAGTAAAGGCAACGTGTCTAAAGCTGAAGAAACTATTGAAGCCGCACAAAAAGAAATTAATAGTGCATTAGAATATCTTGACGGTGAAGGAGCTGCTGCCTTAGCTAAAGCTCGCAATAGATCTGATCAATTTGGAAAACAATCAGTAGATATGTCTGCTTTGGCTAAAGAGTCGCGACTTTTGGCAGAAAAATTAGAAAAGGATGCCAGAAATATTAGAGACATAGCTGAAAAAGCATTAAACACATCTGTGGCAGCTCATATAATTGCTAAGGATGGTATAAAGAAACAAGCAAATATAAGCAATGAAGTGCAAATTTTAGCCACTGAACTAAATGCAGCATCTGGTAAACTAAGTAGTATGTCTGAGCTTGCTGGACAGGCTTTAAAAAGAGCAAAGACTGTCTACGATGAGGCATTGGGTCTCTATGCTGAAGTCAATACCACATTATTGCCAGACATTAAACTCAGCAAATTACGTCAAGACTCTATGGAAATGAATAGAACAATCGATGAAAAATCTGCAGAGTTAGAACAGTTAATATCAGTAACTGAAGATACTCTTCAGGCTTTGGATGATGAAATTCGACGAGGCAAGGACCTCTTAGAACAGGGTCACGACAGACAAGATGAACTTTATGATTTACTGGCAAAACTGGACCAGTTGCGTGCTCAGGCACAAAATGATGTCGAACTTACTAAAGCTACATTAAAAGATGCCAATGAAATATATAAGACCTTAAAGGAATTTAGTGATCAAGTGACGGAATCGCGACAGCAGGCAGAACAAGCGGCACTAGATGTTCCCGGAGTTCAGGAGAAAGTAGCTCTGGCTGAAGAAAGCATCACTAGTATTAGTGAAGAGCTCACTACTGCAAGTGATAAGGCCAAAGAAGCTCGGGATTTAGCTCAGAAGGCACAAAAAGAATATGCTGATAAGGCTTCTGAGGCAGCACACGAAATACGAAAAAAAGCCTCATCATATCGAGTGGAAGCTGGAAAGCTTCGGGATGAAGCTGATAAATTATCAACTCGTGTAAAGGGGACGGCTAAGCAGATACAAGTACTTGAAAAACAAGCTGATGAAAATATGCAGCTTACTAGGGATGCTAAAATGAAGGTGGGCCAAGCCAACACTGACGCTCGGGAGGCTGAAAAACAAGTCTCAAAAGGTTTGGAAGACTTGAAAGTCATTATGGACGAGTTGCAGAATCTACCAACTCTAGACGATGCTGCGCTCGACAGATTACAGGAAAGTCTCGACAAATCTGAAGCTGCCTTATTGGAAGTTGATTTGGACGGCAAAATCAAATCCTTAACCGAGGCTAAGAATAACCACCAAAGGTGGATGAAGCAATATCAGGAGGAACATGATGAGCTTCGCAGTGAAGTGGACAATATCAAGGATATCTTGGATCAGCTACCGGACGGTTGCTACAAACGGATCGTACTCGAGCCGACTGAAGGTCCAAGTAGACCCGCAAGCTTTAGATAG

Protein sequence:

>DPOGS203934-PA
MARPFIYTYFLTFMAISSAQDNFHTHRTSQKLASCYRNDGQPQRCIPEFENAAFMVQMETTNTCGDNGGKMYCIQTSAGTSMRSCDFCQPGQFSSYYLTDLHYEQDNQTWWQSETMKEGIQYPNQVNLTLHLGKAYDITYVRIVFYSPRPQSFAIYKKASEDSEWEPYQYFSASCRDTYGVLEQRAAEIGAETKALCTSEYSDISPLSGGNVLFSTLEGRPSAYTFDSSPELQEWVTATDLRISLDRLNTFGDEIFGDVQVLQSYWYAIADVAVGARCKCNGHASVCENQEMPDGSRVRYCRCEHNTAGKECERCLDFYNDAPWGRASPTNVHECKACNCNGFSNKCYFDKDLYENTGHGGHCMDCSENRDGPNCERCKENYFQSMQDICMPCNCNPTGSRSLQCNAEGKCQCKPGVTGDKCDICAPNHYEFTNQGCKPCGCNESGSFDNTPQCDPITGRCFCKQNVEGKQCRECKPGFFNLDLENEFGCTPCFCFGHSSQCTSAPKYQAHELSAHFIRDAEKWNAEDSNHKPATLQFNANTQNIAVSSKDTEVVYFLASNQFLGDQRQSYNHDLKFNLRLGEKRGYPSSQDIILEGSRTSISMNIYGQNNPEPTDQGQEYAFRLHEDPRYGWTPTLSNFEFISILQNLTAIKIRGTYNKGGQGYLMNFKLDTAKIGREKGSAPANWVEKCSCPKAYVGDYCEECAPGFKHEPANGGPYSTCIPCDCNGHAHICDTATGFCICKHNTTGSNCELCAKGFYGNAIAGTPDDCKPCPCPKDSGCIQLMDQSIVCTDCPSGYAGPRCEVCADGHFGDPTGQFGNSQECVECQCNGNVDPNAVGNCNRTTGECLKCIYNTAGEHCDKCLSGYFGDALDQKKKGDCKPCQCHEAGTLVSAEGPPQCDGLTGFCSCMPHVIGKNCDQCEDGYFDISSGEGCRACDCNLEGSYNGTCNSVTGQCYCKPGIDGIHCDRCLAYHYGFSSDGCNNCDCDEWGSTNYQCDMLGQCSCQQNVEGRRCDRCMENKRPRTDGQGCEDCPPCYNLVQDAVNQHRKELKELDNILGKISKAPTVIENADFDNELQRVRADIDRLVQEAEAELGNGPSSSLTNNLADLSDRLADVRNMLFKIEDESYEGNESIERSKGNVSKAEETIEAAQKEINSALEYLDGEGAAALAKARNRSDQFGKQSVDMSALAKESRLLAEKLEKDARNIRDIAEKALNTSVAAHIIAKDGIKKQANISNEVQILATELNAASGKLSSMSELAGQALKRAKTVYDEALGLYAEVNTTLLPDIKLSKLRQDSMEMNRTIDEKSAELEQLISVTEDTLQALDDEIRRGKDLLEQGHDRQDELYDLLAKLDQLRAQAQNDVELTKATLKDANEIYKTLKEFSDQVTESRQQAEQAALDVPGVQEKVALAEESITSISEELTTASDKAKEARDLAQKAQKEYADKASEAAHEIRKKASSYRVEAGKLRDEADKLSTRVKGTAKQIQVLEKQADENMQLTRDAKMKVGQANTDAREAEKQVSKGLEDLKVIMDELQNLPTLDDAALDRLQESLDKSEAALLEVDLDGKIKSLTEAKNNHQRWMKQYQEEHDELRSEVDNIKDILDQLPDGCYKRIVLEPTEGPSRPASFR-