Monarch geneset OGS2.0

DPOGS200296
TranscriptDPOGS200296-TA4359 bp
ProteinDPOGS200296-PA1452 aa
Genomic positionDPSCF300026 - 409412-442267
RNAseq coverage659x (Rank: top 19%)
Annotation
HeliconiusHMEL0000530.088.32% 
BombyxBGIBMGA005570-TA7e-16686.67% 
Drosophilal(2)gl-PI0.046.27% 
EBI UniRef50UniRef50_D0ABA20.084.22%Putative lethal (2) giant larvae n=3 Tax=Nymphalidae RepID=D0ABA2_9NEOP
NCBI RefSeqXP_966843.10.058.11%PREDICTED: similar to lethal giant larva, putative [Tribolium castaneum]
NCBI nr blastpgi|2613359170.084.22%putative lethal (2) giant larvae [Heliconius melpomene]
NCBI nr blastxgi|2613359170.084.22%putative lethal (2) giant larvae [Heliconius melpomene]
Group
Gene OntologyGO:00055153.3e-34protein binding
KEGG pathwaytca:6552350.0 
 K06094 (LLGL)maps-> Tight junction
InterPro domain[22-40] IPR0006642.9e-51Lethal(2) giant larvae protein
[36-544] IPR0110463.3e-34WD40 repeat-like-containing domain
[435-468] IPR0159434.4e-25WD40/YVTN repeat-like-containing domain
[267-384] IPR0135771e-24Lethal giant larvae homologue 2
Orthology groupMCL11002 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200296-TA
ATGGCCGGCAGAATGCTAAAGTTTATACGAGGGAAGGGCCAGCAGCCCTCTGCAGAGAGACAGAAACTGCAAAATGAACTATTCGCCTTCCGTAAGACAGTCCAACATGGCTTCCCTCACAGGGCGTCGGCGCTAGCCTGGGACCCGTTACTCCGCCTGGCAGCCCTCGGCACAGCGACGGGAGCATTGAAAGTGTACGGAAGACCAGGCGTCGAGCTCTACGGACAGCACACCAATCCGGAAGTTGTAGTCACCCAGATACATTTCATACCAGGAACAGGACGGCTCATATCACTCTGCGATGATAACAGTCTCCATTTGTGGGAGATAAACGAGAAATCTCTAGTGGAACTCAAATCACATTCCTTCGAAGGAAAAAATAAGAAAATATCATCAATATGCGTCGAATCTTCTGGAAACAACCTCCTGCTTGGAACCGAGGGCGGAAATATATACTCCTTAGACTGCAACGCTTTCACAATACACGAAGACGTTATTTATCAGGACACTGTTATGCAAAACTGTCCAGAAGACTTCAAGGTCAATCCAGGTGCAGTGGAATCTATATGCGAACACCCGAAGACACCAAGTCGATTACTAATAGGATACAACCGAGGGCTTGTGGTGTTATGGGACAGAGCGACCTCAGCGCCCTCACACACCTTCGTCTCCAACCAACAGCTGGAGAGCTTGTGCTGGAACGATGAAGGAGAACACTTCACGTCGTCTCACAACGACGGTTCGTATGTGACGTGGGAGGTGGCGGGCGCCGCCAGTGATCGCCCGCTGAAGGAACCCGTGACTCCTTATGGACCTTACCCCTGTAAGGCCATCTCCAAAATACTGAATAGGACCAGCGTGGGAGGAGACGAGCTTATCATATTTGCAGGTGGTATGCCGAGAGCTTCATATTCGGACAAGTACACAGTGACCGTCCAGCAAGGAGAAAAACACGTCGCTTTCGACTTTACCAGCCGTGTGATAGACTTCTTTACAACAACTCCCGTGCCGCCCGACGGCGTCCCTCTACAGGAGGAGCGCCCTACCATCCCCGCCGTCATACAAGGACAGGCAGTGAACCAAGTTGCCGCTACTCTGGTGGTGTTAGCTGAAGAGGAGCTGGTGGTGTTAGATCTGTGTGATGAGAGATGGCGTCCGCTAAGACTGCCGTACCTGGTGTCGGTACACGCCTCCGCCATCACAACCGCTTACCTCGTCGACAACGTCGCCGACAATGTTTACGATAACATTGTGCAAGCGGGTCAACAACAAACCGAAAACATTTACTCCGAGAGCCAGTGGCCTATCTCCGGAGGGATAGTAGAAACGGCCGAACAGACAGACAAACAGATCCTGCTCACCGGCCACGAAGACGGCTCGGTCCGCTTCTGGGACGTGACCGGCGTCGTCATGACGCCTCTATATAAATACACAACCGCCCCACTCTTCAGCGGCGAAGAGATATGTGAAAATAACGACAGTCAAAACGACGAGGAAGAGTGGCCGCCGTTCAGGAGAGTCGGCACATTTGACCCGTACAGTGACGACCCGCGACTGGCTGTGAAAAGAGTCCTGCTCTGTCCGCTGTCTGGTATGCTAACTATCGGTGGCGCGGCTGGCCACATTGTTATAGCGAGTCTGAAGTCCACACCAAGCACGGCGGAAGTAAAATCCGTATCCGTGAACATCGTGTCTGACAGAGACGGCTTCGTATGGAAGGGTCACGATCAACTAAACCTCCGTTCCGGACCGCTCACGTTCCCCGCCGGCTATCAAGCGAGCGCTGTGTGCCAGCTGTCCCCGCCGGCCGCGGTGACAGCCCTGTCCGCGCAGTGGGAGTGGGGGGTGGTGGTGGCGGGCACGGCGCACGGCCTGGCACTCATCGACGCGCTCAAACCCGCGCCTCTCACACACAAGTGCACTCTCAACGCACACGGCCACTCCGGGGCGGGCGATACTCCTATTTCAAGAAGGAAATCATTCAAGAAATCACTCAGAGAATCCTTCCGAAGACTCAGGAAGGGGAGATCCCAGAGACGGCAGACAACTACTTCAGCCAGTCCCACATCACCCACGCAGTTCTATAAACTGCCAACACAGCCAGCTTCAAAAAAGATAACGGAAAAGATAAATGAAGCGGACGCTGACGTCAAACCCATCGAAAGAGCCGTGGAAGCGAGGTCCACTGACGACGCCTTCGGCACCATGGTGCGCTGTCTCTACTTCGCGAGGACCTTCCTTGTCAGCACTCAGAACTCGACGCCCACATTATGGGCCGGCACTAACAATGGCACGGTTTACGCATTCACGATACACGTACCCAATACCAACAAGAGGAAAGAGGAACCCGTAACATGTCAACTAGCTAAGGAGATCCAGCTGAAGCATCGAGCCCCTGTGATAGGTATAACAGTGCTGGACGGAGCGTCCGTGCCGCTACCGGATCCTTTGGAGGTGGAGCGTGGCGTCGCACCTCTGCCGGAGGCCGGGCCTCAGCGAGTCCTCATCACGTCGGAGGAGCAGTTCAAAGTGTTCACGCTGCCGTCACTGAAGCCGCACAACAAATACAAGCTGACAGCACACGAGGGCGCCAGGGTGCGGCGCACAGCGTTCGCGTGGTTCGAGTGCGGCGGAGGGTCGGAGCGGCACCGGGAGTGGTGTCTGCTGTGTCTCACCAACCTGGGAGACTGTCTCGTGCTATCTCCTGACCTCAGGAGACAACTGAACGCGGCCGCCGTGCGCAAGGAAGACATTAATGGTATTTCCAGTCTGTGCTTCTCTAAGCGAGGCGAGGCTCTCTACTTGCACTCCTCGTCCGAGTTGCAGAGGATAACGCTCTCCGCGACTAAGGTGACCATAGCTCAATGCCACGTGTTGCTATCTCCATGGGCCGCCGCCCTCCGCGGGCCGGCGGACGAGGCGCCGCTCACTAACGGCGAACACAAAGTGAGCCACTCCGGGGCGGGCGATACTCCTATTTCAAGAAGGAAATCATTCAAGAAATCACTCAGAGAATCCTTCCGAAGACTCAGGAAGGGGAGATCCCAGAGACGGCAGACAACTACTTCAGCCAGTCCCACATCACCCACGCAGTTATATAAACTGCCAACACAGCCGGCTTCAAAAAAGATAACAGAAAAGATAAATGAAGCGGACGCTGACGTCAAACCCATCGAAAGAGCCGTGGAAGCGAGGTCCACTGACGACGCCTTCGGCACCATGGTGCGCTGTCTCTACTTCGCGAGGACCTTCCTTGTCAGCACTCAGAACTCGACGCCCACATTATGGGCCGGCACTAACAATGGCACGGTTTACGCATTCACGATACACGTACCCAATACCAACAAGAGGAAAGAGGAACCCGTAACATGTCAACTAGCTAAGGAGATACAGCTGAAGCATCGAGCCCCTGTGATAGGTATAACAGTGCTGGACGGAGCGTCCGTGCCGCTACCGGATCCTTTGGAGGTGGAGCGTGGCGTCGCACCTCTGCCGGAGGCCGGGCCTCAGCGAGTCCTCATCACGTCGGAGGAGCAGTTCAAAGTGTTCACGCTGCCGTCACTGAAGCCGCACAACAAATACAAGCTGACAGCACACGAGGGCGCCAGGGTGCGGCGCACGGCGTTCGCGTGGTTCGAGTGCGGCGGAGGGTCGGAGCGGCACCGGGAGTGGTGTCTGCTGTGTCTCACCAACCTGGGAGACTGTCTCGTGCTATCTCCTGACCTCAGGAGACAACTGAACGCGGCCGCCGTGCGCAAGGAAGACATTAACGGTATTTCCAGTCTGTGCTTCTCTAAGCGAGGCGAGGCTCTCTACTTACACTCCTCGTCCGAGTTGCAGAGGATAACGCTCTCCGCGACTAAGGTGACCATAGCTCAATGCCACGTGTTGCTGTCTCCATGGGCCGCCGCCCTCCGCGGGCCGGCGGACGAGGCGCCGCTCACTAACGGTGAACACAAAGAAGAACCGTCTGAAGCGCCCCACGATGTGACAGCAGCTTCGGCGGACATCACCGTCGACTCTGTCAGAGATCACACGTCACAGGACAACGCCACGGGGGACTTGAATATTAACTTGCAGAATTCTCAAGTGAACACAACGTCTATGGTTGTTAAGACGACCACGCGGACCACCGTCAACGAAAACAACGCGGACGGACAGGCGGTCACCACCACCACCACCACCACAACAAACTCCACGAACGAGAACATCCTAGAGCACAGTCGCGAGGAAGGAGTCATCACCCGTATCGAGACGGGCACCGTGACGGTCCCAGCCGGCACCGACCCCAAGCTGATACTGGAGATGTTTGACCGACAGCGGTCGCCGCTCGCCGTGCCCACCCCCGCCGACACATAG

Protein sequence:

>DPOGS200296-PA
MAGRMLKFIRGKGQQPSAERQKLQNELFAFRKTVQHGFPHRASALAWDPLLRLAALGTATGALKVYGRPGVELYGQHTNPEVVVTQIHFIPGTGRLISLCDDNSLHLWEINEKSLVELKSHSFEGKNKKISSICVESSGNNLLLGTEGGNIYSLDCNAFTIHEDVIYQDTVMQNCPEDFKVNPGAVESICEHPKTPSRLLIGYNRGLVVLWDRATSAPSHTFVSNQQLESLCWNDEGEHFTSSHNDGSYVTWEVAGAASDRPLKEPVTPYGPYPCKAISKILNRTSVGGDELIIFAGGMPRASYSDKYTVTVQQGEKHVAFDFTSRVIDFFTTTPVPPDGVPLQEERPTIPAVIQGQAVNQVAATLVVLAEEELVVLDLCDERWRPLRLPYLVSVHASAITTAYLVDNVADNVYDNIVQAGQQQTENIYSESQWPISGGIVETAEQTDKQILLTGHEDGSVRFWDVTGVVMTPLYKYTTAPLFSGEEICENNDSQNDEEEWPPFRRVGTFDPYSDDPRLAVKRVLLCPLSGMLTIGGAAGHIVIASLKSTPSTAEVKSVSVNIVSDRDGFVWKGHDQLNLRSGPLTFPAGYQASAVCQLSPPAAVTALSAQWEWGVVVAGTAHGLALIDALKPAPLTHKCTLNAHGHSGAGDTPISRRKSFKKSLRESFRRLRKGRSQRRQTTTSASPTSPTQFYKLPTQPASKKITEKINEADADVKPIERAVEARSTDDAFGTMVRCLYFARTFLVSTQNSTPTLWAGTNNGTVYAFTIHVPNTNKRKEEPVTCQLAKEIQLKHRAPVIGITVLDGASVPLPDPLEVERGVAPLPEAGPQRVLITSEEQFKVFTLPSLKPHNKYKLTAHEGARVRRTAFAWFECGGGSERHREWCLLCLTNLGDCLVLSPDLRRQLNAAAVRKEDINGISSLCFSKRGEALYLHSSSELQRITLSATKVTIAQCHVLLSPWAAALRGPADEAPLTNGEHKVSHSGAGDTPISRRKSFKKSLRESFRRLRKGRSQRRQTTTSASPTSPTQLYKLPTQPASKKITEKINEADADVKPIERAVEARSTDDAFGTMVRCLYFARTFLVSTQNSTPTLWAGTNNGTVYAFTIHVPNTNKRKEEPVTCQLAKEIQLKHRAPVIGITVLDGASVPLPDPLEVERGVAPLPEAGPQRVLITSEEQFKVFTLPSLKPHNKYKLTAHEGARVRRTAFAWFECGGGSERHREWCLLCLTNLGDCLVLSPDLRRQLNAAAVRKEDINGISSLCFSKRGEALYLHSSSELQRITLSATKVTIAQCHVLLSPWAAALRGPADEAPLTNGEHKEEPSEAPHDVTAASADITVDSVRDHTSQDNATGDLNINLQNSQVNTTSMVVKTTTRTTVNENNADGQAVTTTTTTTTNSTNENILEHSREEGVITRIETGTVTVPAGTDPKLILEMFDRQRSPLAVPTPADT-