Monarch geneset OGS2.0

DPOGS207780
TranscriptDPOGS207780-TA3246 bp
ProteinDPOGS207780-PA1081 aa
Genomic positionDPSCF300042 + 59412-69696
RNAseq coverage294x (Rank: top 38%)
Annotation
HeliconiusHMEL0175830.075.47% 
BombyxBGIBMGA005477-TA0.077.93% 
Drosophilamew-PB0.042.66% 
EBI UniRef50UniRef50_Q86G880.076.71%Integrin alpha 1 n=3 Tax=Obtectomera RepID=Q86G88_PSEIC
NCBI RefSeqXP_625120.20.047.67%PREDICTED: similar to Integrin alpha-PS1 precursor (Position-specific antigen 1 alpha chain) (Protein multiple edematous wings) [Apis mellifera]
NCBI nr blastpgi|296502370.076.71%integrin alpha 1 [Pseudoplusia includens]
NCBI nr blastxgi|296502370.076.61%integrin alpha 1 [Pseudoplusia includens]
Group
Gene OntologyGO:00083052.1e-55integrin complex
GO:00071552.1e-55cell adhesion
KEGG pathwaymmu:164045e-117 
 K06583 (ITGA7)maps-> Dilated cardiomyopathy
    Regulation of actin cytoskeleton
    Arrhythmogenic right ventricular cardiomyopathy (ARVC)
    Hypertrophic cardiomyopathy (HCM)
    Focal adhesion
    ECM-receptor interaction
InterPro domain[431-943] IPR0136497.4e-111Integrin alpha-2
[212-224] IPR0004132.1e-55Integrin alpha chain
[274-330] IPR0135191e-12Integrin alpha beta-propellor
[278-314] IPR0135173.4e-09FG-GAP
Orthology groupMCL11017 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207780-TA
ATGGCCCCCGAAAGTGTTTGCGGCGGACCCCCAACGACGGACCTCGCTACCACTCCACCAAAAACAATATTTACGTTGCTAGTCGGTGCTCCTTTGGGTCAAAATTTGCAGCCAAACACGACAAAGTCCGGGGCGCTCTGGAGGTGCAGAGTGAATTCAGCTCCTAGTGACTGCGAACAAATAGTCACGGACGGGAAACGAAGTCTTGACTCCTTCCATTTGACTGGGCCTCATCCAGATGAAATAAAGGACGGTCAATGGCTCGGCGTGTCCGTCAGGAGCCAGGGTGCGGGGAAAAAGGCAGTCGTCTGTGCACACAGATACATTCGCAAATCTGGAGAATCTCAATTTGGACAAGGGCTCTGTTACACTCTCAGTAATGACCTTCAGCTGATCGACATGTGGGAGCCGTGTCGAGGTCGCTCAGTTCAGAGGGAACACGAAGAATTTGGTTTCTGTCAGGTTGGAACAAGTAGTTCTCTCCTTGAGGATGACACGTTAGTTCTAGGCAGTCCAGGGCCATATACTTGGAGAGGTACCATTTTCACTCAGGATACCAACGATGATTTGCTTGAACGTGATAATGTTGTCTACATGGCGCCAGTTGAAGATGGAGCCAGCCCAGTCGAGAAGTATAGCTATTTAGGTATGTCTGTATCTGGAGCTAATTTCTTCGGCCCGGAGGCATCGTATGCAGCAGGAGCCCCTCGTGCTCATGGTACGGGTCAAGTAGTGCTTTTTAGTAAACGAGTAATTTATGACTGGACCAGAAATGATGTCAATATTCTCAATTTTACTCTGGTCCTCAATGGCGAACAGTTCGGTTCTAGCTTTGGATACGAAGTTGCTTCAGCAGATGTTAACGGCGACGGATTACCAGATCTATTGGTCGGTGCTCCATTCTATTTTTCGCGCGATGCTGGTGGTGCAGTATATGTATACTTAAATGAAAAGCATAATCTTCCTCAAGATTACAGTCTTAAACTAACTGGCAAGCCTGAATCACAATTTGGTATTGCAATTGCTAATACCGGAGACCTCAATAGAGATGGTTGTGAAGATGTAGCCATAGGCAGTCCGTACGAAGGCAATGGCGTTGTGTACATTCATATGGGAGATAGAAAACTTGGTCTTAAAGCGAAACCAGATCAGGTCATAAGAGCGGAATCCTTACCAACGGTTATGAGGACATTTGGCTATTCTCTATCTGGTGGAATGGACCTTGACGAGAATGGATATCCAGATTTATTAGTCGGTGCTTATGAAAATAGTAGCATCGCACTTATCAGAACGAAACCGATAATCGACATAAAGACTTCGATAAAACCATCAAACACTATCATCAATATAGACCCTTCCGTCCAAGGCTGCACAAAAGATCCCAATTCTAATTACACTTGCTTCACTTTCCAAGCGTGCTGTATAATCAAATCTCTGGTACGGTCAACTCAAAGCAACAGTCATGAGTTAAATTACGTCATCGAAGCTGAAACTTTCCCTGGCGGAAGAAAATATTCCAGAGTGTTCTTCGATTCGGAAAAATCTATTATTATCAATAAAACGATACATCTGGGCAAAGACGTTGAAGACTGTAGAGAACATGTCGTGTATTTAAAAAATAATACCAGAGACATTCAGACACCAATCAAATTTCAAGTAACCTACAAGATAGTACAGATCCAACCGAAATACTCAACTATAGGAGAATTACCCAGAATAGATGATTATCCAGTACTGAATGCTACCGCGACATCGTCCTTCAGCGCTAACTTCCTTAACGACTGTGGTTTGGATGGAGTATGCGTTAGTGATCTGGTCGTGGAACCCGTATTACAATTACCTAAAACTGAGGATGGAAAATCATATGCTTTAACCCTTGGTCAGGAAGAGGAAATAAAACTCTCCATATCAGTAGACAATTATGGTGAATCTGCGTATGAAGCCCAACTGTTCGTCCAACACCATGCCAGTTTGCACTACATTGCCGCGAATATTACAGGCAAGCACATGATATGCACGAGTGTAAACAAGACGACGGTGTCATGCCTTCTTGAGAATCCGTTCAAGAAGCAACAAGACGGTAATCCACCAATAACGATGCGTTTCGATGCGAGAGCGTTGGAAGATAACGAGCCCTCTGTTATTTTCACTGTTTGGGCAAATTCAACTTCAAAAGCATTACATCCGGAAAAGAAACCGGCACAGGTAGCAGCTCTGGTCATCAAAAACGCTGAATTGCAAATTAAAGGCGCCGCACGACCGGAGCAAGTGTTCTACGGCGGCGAGGTGAAGGGAGAATCTGCCATGACTTACTTTGATGATATCGGCACGAGAGTCGTGCACACATATCAGATATTCAATGAAGGTCCCTGGCGAGTGTCTTTGGTACAGGTTGTGATATCATGGCCTCATCAGGTTGCATCCGAAAACTCGCAGGGGAAATGGTTGCTGTATCCCGAAGATATTCCAACTGTTGATGGAGAAGCAGGTCAAAGTAATGGAAACTGTTTTATATCGGGTAACGAGATCAATCCCCTAAAACTTACATCACGACCGGGTGGGACTGAGCCATATCTGGAGAACCTTGAAATGGATCCCTTCTTAAGTTCTGGCAAACTCAGCGTCAGTTCCAGCGAAAAAAGTCACAACTTCACTAAAGTCCAGAATCAAATTTTCGGTTATTCTACCGGTGCTTCGAAAAATGTAAGAAGAAAAAGACAAAGCGATATTGTTAAGGCCGAAGCTTACACTGACAAGGATGGACAAAGGAAACACGTTGTAAATATGAACTGTCAAAAGAAGAGCGCGAAATGCATACAGTTTCAATGCGTTATATACCGCTTGGGTCGCATGCAGGCGACCACGATCACAGTCCGAGCTCGTCTCTGGAACAGCACGCTGGTGGAGGACTACCCTCGGGTCAGTCACGTCAACATCGCCTCCACCGCCTACATACACATACCAGACAATAACATACACCAGAACAAGATACACGATGACGTCGCTACTGTGGAAACAGTTGCCTATCCTGATTTGAAGATCAGTGAACCTTCGGAAGTGCCTTTATGGGTGATCATAGTGTCTGTAATAGTTGGTTTAATAGTTCTAATTCTTCTCATAATTGCTTTATGGAAACTTGGTTTCTTCAAGAGAAGTCGTCCGGATCCAACACTGTCCGGTAACTTAGAAAAGAATAATCACGAGTCCAGCCCTTTCATTGGTCGAGACAGAACCAGTATTCGATAG

Protein sequence:

>DPOGS207780-PA
MAPESVCGGPPTTDLATTPPKTIFTLLVGAPLGQNLQPNTTKSGALWRCRVNSAPSDCEQIVTDGKRSLDSFHLTGPHPDEIKDGQWLGVSVRSQGAGKKAVVCAHRYIRKSGESQFGQGLCYTLSNDLQLIDMWEPCRGRSVQREHEEFGFCQVGTSSSLLEDDTLVLGSPGPYTWRGTIFTQDTNDDLLERDNVVYMAPVEDGASPVEKYSYLGMSVSGANFFGPEASYAAGAPRAHGTGQVVLFSKRVIYDWTRNDVNILNFTLVLNGEQFGSSFGYEVASADVNGDGLPDLLVGAPFYFSRDAGGAVYVYLNEKHNLPQDYSLKLTGKPESQFGIAIANTGDLNRDGCEDVAIGSPYEGNGVVYIHMGDRKLGLKAKPDQVIRAESLPTVMRTFGYSLSGGMDLDENGYPDLLVGAYENSSIALIRTKPIIDIKTSIKPSNTIINIDPSVQGCTKDPNSNYTCFTFQACCIIKSLVRSTQSNSHELNYVIEAETFPGGRKYSRVFFDSEKSIIINKTIHLGKDVEDCREHVVYLKNNTRDIQTPIKFQVTYKIVQIQPKYSTIGELPRIDDYPVLNATATSSFSANFLNDCGLDGVCVSDLVVEPVLQLPKTEDGKSYALTLGQEEEIKLSISVDNYGESAYEAQLFVQHHASLHYIAANITGKHMICTSVNKTTVSCLLENPFKKQQDGNPPITMRFDARALEDNEPSVIFTVWANSTSKALHPEKKPAQVAALVIKNAELQIKGAARPEQVFYGGEVKGESAMTYFDDIGTRVVHTYQIFNEGPWRVSLVQVVISWPHQVASENSQGKWLLYPEDIPTVDGEAGQSNGNCFISGNEINPLKLTSRPGGTEPYLENLEMDPFLSSGKLSVSSSEKSHNFTKVQNQIFGYSTGASKNVRRKRQSDIVKAEAYTDKDGQRKHVVNMNCQKKSAKCIQFQCVIYRLGRMQATTITVRARLWNSTLVEDYPRVSHVNIASTAYIHIPDNNIHQNKIHDDVATVETVAYPDLKISEPSEVPLWVIIVSVIVGLIVLILLIIALWKLGFFKRSRPDPTLSGNLEKNNHESSPFIGRDRTSIR-