Monarch geneset OGS2.0

DPOGS204081
TranscriptDPOGS204081-TA5430 bp
ProteinDPOGS204081-PA1809 aa
Genomic positionDPSCF300200 + 259666-279080
RNAseq coverage197x (Rank: top 47%)
Annotation
HeliconiusHMEL0131440.059.39% 
BombyxBGIBMGA010816-TA0.060.79% 
Drosophiladsd-PB8e-15338.81% 
EBI UniRef50UniRef50_Q7QH414e-16851.69%AGAP003506-PA n=13 Tax=Endopterygota RepID=Q7QH41_ANOGA
NCBI RefSeqXP_309653.41e-16851.69%AGAP003506-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3479700171e-16751.69%AGAP003506-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3479700171e-17651.37%AGAP003506-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00055151.7e-16protein binding
GO:00160202e-06membrane
GO:00072752e-06multicellular organismal development
GO:00048722e-06receptor activity
KEGG pathwayoaa:1000750554e-08 
 K06243 (LAMB2)maps-> Small cell lung cancer
    Pathways in cancer
    Amoebiasis
    Focal adhesion
    ECM-receptor interaction
InterPro domain[69-195] IPR0008598.8e-21CUB
[524-754] IPR0159151.7e-16Kelch-type beta propeller
[967-1012] IPR0162012e-06Plexin-like fold
Orthology groupMCL10637 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204081-TA
ATGGTAGAATCATTGCAAATGTTTTTATTTCTTTTCAAATCAAAATACAGACGAAAATATTCGTGGTTCTCGCCGTTCTTGTGTTCAGTGCTCATAGTGTTATTGTTTTGTCATGGTGTACTATCGAAGTGTAGTGATCACAACTGTTTTAACGGTGTATGTAACAATGATACCTGCGTGTGCTACGAGGGCTGGCAGGGCTCCCAGTGCCAACACTGCGGCGGGAAGATTAAATTGACGGAGACGTCTGGTGTTATAACCGATGGTCCCGGTAATTATAGCGTTAGTACACAATGCTCGTGGTTGATCACACCGCCACGCGTGGGGCCCACGTTGCCCACTGTGCGGGTGACACTGGAGAGTTTTGCCACGGAGTGCGGATGGGATCATTTGTATGTATACGATGGTGATAGTGTCCGAGCTGAGAAACTATTGGCTGTGTTTAGCGGGGTTTTAGATAAGAACGAGTCTAACTGGACTCGCCAGGTTATAGCGCGGTCAGGTAGTGTTCTTTTGCATTTCTTCTCTGACGATGCTTACGCCATGGAAGGTTTTAATGTCACCTACGATGCCTACTCCTGCCCATCCAACGACCACAGGACCAACTGTTCCGATCACGGCGAGTGCGAGGAAGGTTCCTGTCGGTGTGACGATGACTGGCTCGGGGTAGCCTGTGACCAGCCTTTGTGTCCCAACGACTGTAACGCTATGTACGGAGCTGGGTCGTGTACGTCGTCTGGCTGCGTGTGCACGCCGTCCAAGACTGGAGCAGATTGCAGCCGGGACGCGTTTATATCCGGCTGGGGGTGGGCGTGGCGGGAGGAGGGGGAGGGGGGTGAACGCCCGCGGAACATGCCGCCGCCGACAGCTGGGCACGTGCTTGTCAACTATGGTGACGATATAATAATGGTGGGAGGGGAAATGTTCCAAGACGCAGCGTTTATGTACAGATATAAACCGAGCCTTAAGGAGTGGAAGGTAGTTGAGGCCCGGGGCAAGGCGCCACAGATGCGGTTCGCCCACACGGCCATAGTCCACGGCGAGGAGATCATAGTGTATGGCGGGGTGGTGGTCACCGACGAGCTGGAGAGGAGCGGGGGTCTCGCGGGGGTGGAGGGGCGGGCGGCGTTCGTCAGCAACGAGATCTGGACGGGTCGGCTGTCGGGGGGCTTCGTCCACTGGACCAACGACACGCCGCGGACGTGCTCTCCTCACCACCCCGCGCCGTTCGACCACTGCGGCGGGCTCCACCTGTCGGGCCACTCGTCAGTCCTGGTTCAAATCGGCCCAACCTCCAAGCCCGTGATGTTGGTGTTCTTCGGCCACTCCCCCCACTACGGCTACCTGCATCTCGTACAGGAAACATTGATATGGGAACTATATCTTGGAGACGCTCAGGCCAGCAGCGGGGGTCTCGCGGGGATGGAGGGGCGGGCGGCGTTCGTCAGCAACGAGATCTGGACGGGTCGGCTGTCGGGGGGCTTCGTCCACTGGACCAACGACACGCCGCGGACGTGCTCTCCTCACCACCCCGCGCCGTTCGACCACTGCGGCGGGCTCCACCTGTCGGGCCACTCGTCAGTCCTGGTCCAAATCGGCCCAACCTCCAAGCCCGTGATGTTGGTGTTCTTCGGCCACTCCCCCCACTACGGCTACCTGCATCTCGTACAGGAGTACTATATAGAGGAGAAGGCGTGGGGCGTCGCCCGGACCCGCGGCTGGCCGGCCAGGGGAGGGTTCGCTCACACCGCTGTATGGGACGCGCTCAGCGGCAGGGTGTACGTACACGCTGGACTCGTCTCCGAGTCGGAGGCGACACAGGCGCCGTCCGCCGCGCTGTACGAGTATGAAGTTGAAGCACGGATATGGCGCCCGCTGCCCTCCGCCCCCACGCCCAGATATCTACACACCGCCATATTTATATCGCCAGGGGTCATGTTGGTGTTCGGGGGGAACGCCCACAACGACAGTGCTGCCGCGGCGCTCACGGCCTCGGGCGCGTCCCAGTGCTACGCGGCCAACGCGCTGCTGTACTACGCCAGGTGTCGCCAGTGGATGTCTGCGGGCGGCCTGCTGGGCTCGCCTCGCGCCGGACACGCGGCCGCTCTGCTTCCGGCCAAGAGACCCACCGTCATCATACATGGCGGCTTCGACGGCCGCCTTCGCTCGGACGCGCTTGTCTTCGAGTCCGGAATGCGCTGTTCGTGGTACAAGGACGAAACGTCCTGTATGAACAGCGCCAGGCACGGCGTCTCGTGTGTGTGGCGCCTTAGAGATATGCTGTGCGTCGGGATAAAGGAAGTAGGGTGGAAGGATTCTTTCACGGATGCTGTAAAAGCCTGCATCGACGAGCCAGTAGTCGTTCACTCAGCCTGTGATCTCTGCTCCCCAGATGAGTCTCGCTGTGCCGTGTCTTCGTGTGAAGCTTGTACAGCGCTTGGTTGTGCTTGGTGCGGCTCGTGTCTCCCGTCCGCGTATCACTGTCGACGATCCCGGACGGCACACGGACCGGTGACCCTGTCCGTGTCGGAGTGTCCGCCGAGCGGCGCGTCGTGTTCGCGCTACCACTCGTGCGCCGCGTGTCACGCGCATCTACACAGACACCCTCATGGCTCGGAAGACTTAAACCAACGGGCGTGTTACTGGGACTATGACACGGTGAAGTGCCGGCCGGCCAATGCGACCACGGATATAAGGGGCTCGCCGAGTGTGTCGGGGTCGTGCAGCGCCGCGTGCTCGTCCTATACCACATGCGGGAACTGCACCGCTGAAGAGTGCATCTGGTGCGCCTCCGCCGGGAGGTGCGTGGATAAGAACGCTTACGGAGCTTCGTTTCCGCTGGGCGGGTGTCGCGCGTGGTCCACCAGCGGCTGTGGAGGTGTGGGGGTGACGGGGGGTGTCCCGGGGGGCGGCTGCTCGTCGCACGTGTCGTGTCGCTCGTGTCTGTCGGAGCCCGCGTGCGGCTGGTGTGATGACGGCGCGGGCGGCGGGCGAGGAGCCTGTCTGCCGGGAGGTGACCGTCACCCCCACCATCCCCACATCTGTCCCAGGAGACGATGGCACTTCACGTCGTGTCCGTCGTGTCAGTGTAACGGCCACTCGGTGTGCGACGCGGCGTCCCGTTGTGTCCAGCCGTGCGGGTCCCGGGCCGTGGGCCCCCACTGTGACACTTGCGCCCCCGCGCACTGGGGTACCCCGCTCAACGGGGGGGTCTGCACGCCGTGTGAGTGTAACGCCCAGGCCGTGTCGTGCGCGGCGGACACGGGCCGCTGTTTCTGCAGCACCAAGGGCCTGGCGGGCGACAGGTGCGACAAGTGTGACAACACCAACCACTACCACGCCGACGTCTACAACAAGGGCTGCTACTACGACCTAGCCGTCGACTATCAGTTCACCTTCAACCTGTCCAAGAAGGAGGATCGTCATTTGTCCGCCATTAACTTCCGGAACGCTCCCGTCAAACCGGACGTGGACGCTGACTTCAGTATCACGTGTTCCGCCCACGCCAGGATGAATCTCACCGTCAGGACCAAATCTGATCCTGAGAGGACGTTATTCAGTGACGTCAATTGCACCAATTTTAGATACAAGGTCCGCCAGTTTGCCACTTTTCTCTCAATATCAATTATAATCCTAGTGGTCCGTGTCGGCCTGTATATTACAGTTCACACAGCCTGTGATCTCTGCTCCCCAGATGAGTCTCGCTGTGCCGTGTCTTCGTGTGAGGCTTGTACAGCGCTTGGTTGTGCTTGGTGCGGCTCGTGTCTCCCGTCCGCGTATCACTGTCGACGATCCCGGACGGCACACGGACCGGTGACCCTGTCCGTGTCGGAGTGTCCGCCGAGCGGCGCGTCGTGTTCGCGCTACCACTCGTGCGCCGCGTGTCACGCGCATCTTCACAGACACCCGCATGGCTCGGAAGACTTAAACCAACGGGCGTGTTACTGGGACTATGACACGGTGAAGTGCCGGCCGGCCAACGCGACCACGGATATAAGGGGCTCGCCGAGCGTGTCAGGGTCGTGCAGCGCCGCGTGCTCGTCCTATACTACATGCGGGAACTGCACCGCTGAAGAGTGCATCTGGTGCGCCTCCGCCGGGAGGTGCGTGGATAAGAACGCTTACGGAGCTTCGTTTCCGCTGGGCGGGTGTCGCGCGTGGTCCACCAGCGGCTGTGGAGGTGTGGGGGTGACGGGGGGTGTCCCGGGGGGCGGCTGCTCGTCGCACGTGTCGTGTCGCTCGTGTCTGTCGGAGCCCGCGTGCGGCTGGTGTGATGACGGCGCGGGCGGCGGCGAGGAGCCTGTCTGCCGGGAGGTGACCGTCACCCCCACCATCCCCACATCTGTCCCAGGAGACGTAACCTCCAACCTCCGCGTGTGTGATGTCTGTCCTCTCCCCGCTAGATGGCACTTCACGTCGTGTCCGTCGTGTCAGTGTAACGGCCACTCGGTGTGCGACGCGGCGTCCCGTTGTGTCCAGCCGTGCGGGTCCCGGGCCGTGGGCCCCCACTGTGACACTTGCGCCCCCGCGCACTGGGGTACCCCGCTCAACGGAGGGGTCTGCACGCCGTGTGAGTGTAACGCCCAGGCCGTGTCGTGCGCGGCGGACACGGGCCGCTGTTTCTGCAGCACCAAGGGCCTGGCGGGGGACAGGTGCGACAAGTGTGACAACACCAACCACTACCACGCCGACGTCTACAACAAGGGCTGCTACTACGACCTAGCCGTCGACTATCAGTTCACCTTCAACCTGTCCAAGAAGGAGGATCGTCATTTGTCCGCCATTAACTTCCGGAACGCTCCCGTCAAACCGGACGTGGACGCTGACTTCAGTATCACATGTTCCGCCCACGCCAGGATGAATCTCACCGTCAGGACCAAATCTGATCCTGAGAGGACGTTATTCAGTGACGTCAATTGCACCAATTTTAGATACAAGTTCGCCAAGTCCGAGCACGCCTTCGGTGTGGAGGACAACGTGACGCTGACGACGTTTTTCGTGTACGTGTACGACTTCCGGCCGCCGCTCTGGATACAGATCTCCTTCTCTCAGTACCCGAAACTCAACTTGCAGCAGTTCTTCATCACGTTCTCGTCGTGCTTCTTGATGCTGCTGTTGGTCGCTGCGGCACTGTGGAAGATGAAACAGAAGTACGACCTGTACCGCCGCCGCCAGCGCCTGTTCGTTGAGATGGAACAAATGGCGTCCCGGCCCTTTAGCACAGTGAGCATAGAGCTGGAGCGGGGAGGGGGCGAGGGCGGAGTCCCGGCCCCTGTGGCGTTGGAGCCGTGCCGCTGGGGTCGGGCGGCCGTGCTGTCCCTGGTGGTGCGCCTGCCGCAGGGCGGGGCGGGTCGAGCGCCCCCTCAGGGCGGCCTCGCCCTCGCCTCGGCCCTCGTCACCCTCGGCCACGCTCACCACCACGACAGGTGA

Protein sequence:

>DPOGS204081-PA
MVESLQMFLFLFKSKYRRKYSWFSPFLCSVLIVLLFCHGVLSKCSDHNCFNGVCNNDTCVCYEGWQGSQCQHCGGKIKLTETSGVITDGPGNYSVSTQCSWLITPPRVGPTLPTVRVTLESFATECGWDHLYVYDGDSVRAEKLLAVFSGVLDKNESNWTRQVIARSGSVLLHFFSDDAYAMEGFNVTYDAYSCPSNDHRTNCSDHGECEEGSCRCDDDWLGVACDQPLCPNDCNAMYGAGSCTSSGCVCTPSKTGADCSRDAFISGWGWAWREEGEGGERPRNMPPPTAGHVLVNYGDDIIMVGGEMFQDAAFMYRYKPSLKEWKVVEARGKAPQMRFAHTAIVHGEEIIVYGGVVVTDELERSGGLAGVEGRAAFVSNEIWTGRLSGGFVHWTNDTPRTCSPHHPAPFDHCGGLHLSGHSSVLVQIGPTSKPVMLVFFGHSPHYGYLHLVQETLIWELYLGDAQASSGGLAGMEGRAAFVSNEIWTGRLSGGFVHWTNDTPRTCSPHHPAPFDHCGGLHLSGHSSVLVQIGPTSKPVMLVFFGHSPHYGYLHLVQEYYIEEKAWGVARTRGWPARGGFAHTAVWDALSGRVYVHAGLVSESEATQAPSAALYEYEVEARIWRPLPSAPTPRYLHTAIFISPGVMLVFGGNAHNDSAAAALTASGASQCYAANALLYYARCRQWMSAGGLLGSPRAGHAAALLPAKRPTVIIHGGFDGRLRSDALVFESGMRCSWYKDETSCMNSARHGVSCVWRLRDMLCVGIKEVGWKDSFTDAVKACIDEPVVVHSACDLCSPDESRCAVSSCEACTALGCAWCGSCLPSAYHCRRSRTAHGPVTLSVSECPPSGASCSRYHSCAACHAHLHRHPHGSEDLNQRACYWDYDTVKCRPANATTDIRGSPSVSGSCSAACSSYTTCGNCTAEECIWCASAGRCVDKNAYGASFPLGGCRAWSTSGCGGVGVTGGVPGGGCSSHVSCRSCLSEPACGWCDDGAGGGRGACLPGGDRHPHHPHICPRRRWHFTSCPSCQCNGHSVCDAASRCVQPCGSRAVGPHCDTCAPAHWGTPLNGGVCTPCECNAQAVSCAADTGRCFCSTKGLAGDRCDKCDNTNHYHADVYNKGCYYDLAVDYQFTFNLSKKEDRHLSAINFRNAPVKPDVDADFSITCSAHARMNLTVRTKSDPERTLFSDVNCTNFRYKVRQFATFLSISIIILVVRVGLYITVHTACDLCSPDESRCAVSSCEACTALGCAWCGSCLPSAYHCRRSRTAHGPVTLSVSECPPSGASCSRYHSCAACHAHLHRHPHGSEDLNQRACYWDYDTVKCRPANATTDIRGSPSVSGSCSAACSSYTTCGNCTAEECIWCASAGRCVDKNAYGASFPLGGCRAWSTSGCGGVGVTGGVPGGGCSSHVSCRSCLSEPACGWCDDGAGGGEEPVCREVTVTPTIPTSVPGDVTSNLRVCDVCPLPARWHFTSCPSCQCNGHSVCDAASRCVQPCGSRAVGPHCDTCAPAHWGTPLNGGVCTPCECNAQAVSCAADTGRCFCSTKGLAGDRCDKCDNTNHYHADVYNKGCYYDLAVDYQFTFNLSKKEDRHLSAINFRNAPVKPDVDADFSITCSAHARMNLTVRTKSDPERTLFSDVNCTNFRYKFAKSEHAFGVEDNVTLTTFFVYVYDFRPPLWIQISFSQYPKLNLQQFFITFSSCFLMLLLVAAALWKMKQKYDLYRRRQRLFVEMEQMASRPFSTVSIELERGGGEGGVPAPVALEPCRWGRAAVLSLVVRLPQGGAGRAPPQGGLALASALVTLGHAHHHDR-