Monarch geneset OGS2.0

DPOGS201843
TranscriptDPOGS201843-TA2610 bp
ProteinDPOGS201843-PA869 aa
Genomic positionDPSCF300191 - 366501-375420
RNAseq coverage46x (Rank: top 71%)
Annotation
HeliconiusHMEL0039371e-11349.66% 
BombyxBGIBMGA006095-TA0.061.84% 
Drosophilawb-PB7e-13334.97% 
EBI UniRef50UniRef50_D6WJN18e-13537.16%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WJN1_TRICA
NCBI RefSeqXP_624587.22e-13935.67%PREDICTED: similar to wing blister CG15288-PB, isoform B [Apis mellifera]
NCBI nr blastpgi|3287927634e-13835.67%PREDICTED: laminin subunit alpha-1-like [Apis mellifera]
NCBI nr blastxgi|3287927632e-16135.67%PREDICTED: laminin subunit alpha-1-like [Apis mellifera]
Group
Gene OntologyGO:00310121e-07extracellular matrix
GO:00071551e-07cell adhesion
KEGG pathwaygga:4217098e-103 
 K06239 (LAMA2)maps-> Amoebiasis
    Viral myocarditis
    Arrhythmogenic right ventricular cardiomyopathy (ARVC)
    ECM-receptor interaction
    Pathways in cancer
    Small cell lung cancer
    Dilated cardiomyopathy
    Focal adhesion
    Hypertrophic cardiomyopathy (HCM)
InterPro domain[590-632] IPR0020493.1e-09EGF-like, laminin
[432-543] IPR0000341e-07Laminin B type IV
Orthology groupMCL25207 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201843-TA
ATGGGTCCTCACACGTTCATTAGTCCCGCCATACATGGCGTTGATTCCATCTCTTCCAAGTATCGCAACGAGCAGAGTATAAAGAAAATGGAGCTTCTTCACGTCATTATCAAAGCGGGTCCGTCGCCCCGTCCCCTGGCCTGGTCTTTAGAAGCCTCTACAACAGAGAATGGCGACGACTGGACGTTGATACGAGCGTTTGGAGATCGAGATCACTGTCGGAAGCTATGGGACCTTCGGCCTGAACGAAGACGGAGAAAAGCTCGGGCGGCGAAGAGAGCTGGTCGGGCCGAGAAACCTTCTTGCTCGACACAGTTCACTAGTCCACGTCCTTTGGAAAACGGAGAGATGCATGTGGGTATGGGTGAAGGGGTGACAGCTCGCCGGGTGCGAATCTCGTTTCGCGCAGCTCACTCCCCTCTGAGGCATCAGTATTATACAGTCAGAGAACTCACCCTGGCAGCGAGGTGTCTGTGCCATGGACACGCTGACCACTGCATTGTGGACAATAAAAGTGCAAAATGTTCATGTCTGCATGGCACATGCGGCGCCCACTGTCAACGTTGCTGTTCGGGAGCACCCTGGCACGATGGTCAATCCTGTGACATAACACAGGAGAAGAATGAGTGTTCCTGTGGAGAAAGAGGGGCCTGCTCATATGACGATACTGGCGCTATATTGTGCGTTAACTGTACAGATAACAGAGCCGGACCGCTATGTAATCGTTGTCTCTTCGGTTACTACAATGTCTTCCCTGACGACCCTTGCCAACCTTGCGATTGCGATCCTGAAGGCTCAGATGGCACCTGTAAGTGGAATAAAAAACTTCATCAAATTGTCTGCACCTGTAATCCAGGATTCACCGGCCTTCAGTGTGAAAAGTGTGAAAATCCAAACGCAGTATTCCCAAATTGTTTAGTGGAGGTGACCACGGCAACATGTAAATGTGATCCGCGAGGTATTGTTGACCCTGGCAGGGTGTGCGATGACATTTGCGAGTGCAAGGTCAATGTGATCGGAGAGAAATGTGATACTTGTGCCCCGGGACACTTCGGGTTGAGTTCATCTCTTCCGGGAGGCTGTCGAGCTTGCTACTGCTCGCACGTCGCCTCTCATTGTGAATCAGATCCAAGACCCGGCCCGGATATCGCGTTTCCTCTGGGTAACGAGTGGATGATAAGTGACAACAACAGTTCAGAGACGTTAGAACCTTCCGTAGACGACAAGGGAAAACCTTACCTTATAAGCTATGAGGTGGAGGGTTGGGAGTCATATTACTTCGTCACTTCGTACCTCACTGGGCAGCAGTTGTCTTTGTATGGCGGAACGCTACTGTCTCAATTAGCTTGGGGTATCGCTCGAGGAGACTCGGGCGGAAGCGCAAGTCGAGCTCCTGATTTTATACTCGTTGGAAACGATGGAATTAAGCTGACTTACGGCAACACAAGCTATGAAACGCCTGGTCTGATAGAATTAGTGGCTCCTCTTGAAGAGGGAGGGTGGTATTTGAATTCGGAGGTGGCTTCACGGTCGCAGCTTATGGACGTAATGAGTAACTTAAAGTCAGTCATGATTCGAGCACACTTTCATTTTGATCAGGATGAGGTTCGTTTGGAGCGGGCTCATATACGAGGTTCCGAGGGTGGTGTAAGAGAACTTTGTACGTGTGCTCCTCAACACGCGGGGACACAGTGCGAGACTTGTGCGCCGGGACACGTGCGACTAGAGCGAGCTCCCGGAACTTCCCCCGCCTTTGAGTGCGTGCCGTGTGACTGCAACATGCACGCTGATTGTGACTCTGTAAATGGTCCTTGTGGCCCCTGCCAACATAATACCACCGGACCGCATTGCGAACGATGTCTCCCAGGACATTACGGTAACCCCGTCCAAGGTGCTTGTAAACCATGCGCATGTCCTCTGTATGAGCCGTCTAATAACTTCAGTCCTAACTGCGCCTTGGCGGCGGCTGTTGGTGATGATTATGTGTGCACACAATGCCCGGATGGATACGCTGGAGATCACTGCGAGATATGCGACTCGGGCTACTGGGGTTCGCCGGCTACAGTGGGAGGCTCGTGTCAGCCCTGTGATTGCGGTGGAGGTCCGTGTCATCCGCAGTCGGGCGCCTGCCTCGTGTGTCCGCCACATACAGAGGGGGAGCGCTGCGATCAGTGCCAGGAGGGCTATTGGTCCGGTGGAGACAATGGGTCCTGTATTGCATGTGGATGTGGATTGGGAGCGCTGTCAACGGCGTGTGACCCTCGCGCTGGGCACTGCGCCTGCGCACCTGGCTGGACGGGAAGAGCCTGTAATGTCTGCGCCACAGGACACGGCAACGTGTCGGCGGGCTGTCCCCTGTGTGCGTGCGGCGCGGGTTCGTCAAGTGCAATGTGTGGGACGGAGCACGGGGAGTGTCCCTGTATGTCCGGGGCCGCTCCGCCTCGATGCGACACGTGTCTAGAACAGCACTACAACCTGTCCACTACTGGCTGTTTGAGCTACCTTCAATACCAGGTTCAAAGTGGCATGCGGACTTTGCAATATGAAGTCCTCATCCACCTCAGCGCCACATTTTTGATAGTTTCAATCCACAACAAATTATTGCAGTGA

Protein sequence:

>DPOGS201843-PA
MGPHTFISPAIHGVDSISSKYRNEQSIKKMELLHVIIKAGPSPRPLAWSLEASTTENGDDWTLIRAFGDRDHCRKLWDLRPERRRRKARAAKRAGRAEKPSCSTQFTSPRPLENGEMHVGMGEGVTARRVRISFRAAHSPLRHQYYTVRELTLAARCLCHGHADHCIVDNKSAKCSCLHGTCGAHCQRCCSGAPWHDGQSCDITQEKNECSCGERGACSYDDTGAILCVNCTDNRAGPLCNRCLFGYYNVFPDDPCQPCDCDPEGSDGTCKWNKKLHQIVCTCNPGFTGLQCEKCENPNAVFPNCLVEVTTATCKCDPRGIVDPGRVCDDICECKVNVIGEKCDTCAPGHFGLSSSLPGGCRACYCSHVASHCESDPRPGPDIAFPLGNEWMISDNNSSETLEPSVDDKGKPYLISYEVEGWESYYFVTSYLTGQQLSLYGGTLLSQLAWGIARGDSGGSASRAPDFILVGNDGIKLTYGNTSYETPGLIELVAPLEEGGWYLNSEVASRSQLMDVMSNLKSVMIRAHFHFDQDEVRLERAHIRGSEGGVRELCTCAPQHAGTQCETCAPGHVRLERAPGTSPAFECVPCDCNMHADCDSVNGPCGPCQHNTTGPHCERCLPGHYGNPVQGACKPCACPLYEPSNNFSPNCALAAAVGDDYVCTQCPDGYAGDHCEICDSGYWGSPATVGGSCQPCDCGGGPCHPQSGACLVCPPHTEGERCDQCQEGYWSGGDNGSCIACGCGLGALSTACDPRAGHCACAPGWTGRACNVCATGHGNVSAGCPLCACGAGSSSAMCGTEHGECPCMSGAAPPRCDTCLEQHYNLSTTGCLSYLQYQVQSGMRTLQYEVLIHLSATFLIVSIHNKLLQ-