Monarch geneset OGS2.0

DPOGS209596
TranscriptDPOGS209596-TA4548 bp
ProteinDPOGS209596-PA1515 aa
Genomic positionDPSCF300015 - 529707-577290
RNAseq coverage1551x (Rank: top 8%)
Annotation
HeliconiusHMEL0170070.066.23% 
BombyxBGIBMGA006655-TA0.055.76% 
Drosophilaif-PD6e-17140.59% 
EBI UniRef50UniRef50_Q1G0S60.070.95%Hemocyte-specific integrin alpha subunit 2 n=2 Tax=Obtectomera RepID=Q1G0S6_MANSE
NCBI RefSeqXP_311131.40.043.49%AGAP000032-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|296502390.072.96%integrin alpha 2 [Pseudoplusia includens]
NCBI nr blastxgi|989625040.059.34%hemocyte-specific integrin alpha subunit 2 [Manduca sexta]
Group
Gene OntologyGO:00083051.2e-50integrin complex
GO:00071551.2e-50cell adhesion
KEGG pathwaymdo:1000270904e-120 
 K06584 (ITGA8)maps-> Dilated cardiomyopathy
    Regulation of actin cytoskeleton
    Cell adhesion molecules (CAMs)
    Arrhythmogenic right ventricular cardiomyopathy (ARVC)
    Hypertrophic cardiomyopathy (HCM)
    Focal adhesion
    ECM-receptor interaction
InterPro domain[475-828] IPR0136494.7e-84Integrin alpha-2
[260-272] IPR0004131.2e-50Integrin alpha chain
[375-431] IPR0135194.6e-13Integrin alpha beta-propellor
[316-355] IPR0135176.4e-07FG-GAP
Orthology groupMCL10472 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209596-TA
ATGGCGCTATTTTTGTGCGTTTCCTTATGCGCCTGGGCAGGAGCCTTAGGGTTCAATGTGGACATCCCTTCTAGGGTGGTTTATACGGGAAATCATCAGTCTATGTTTGGATTTACTGTCCAGAGTCATGTGGACGGTGATAGAAAGATGATACTGGTGGGGGCTCCAGAGGACGAGCCCTATCGCGTGGAGAACGTTCAACGTCCGGGCGCCGTGTACCGCTGCCAACCATCACATCGCGGCTACACCTCCGAGAATGCCAACAACCATCTCCAGATGTGCTCCTTGATAGAGTTTGACAAGAATAGATATAACAACATGGATACCGCTCGTCGTCAGATAGACCAGAAGTCCAACCAGTGGTTCGGGGCCACTCTCACCAGCACGGGCCGAAACGGACCCATTGTGGCTTGCGCTCCGCGTTACGTGTCCTTCGTGTCGGCAAAGTTGAACCAAAGAGACCCTGTGGGGTCCTGCTTCGTAGCTCAGACCCCAGATGCCGTGGAAGTCAATGAGTTCTCACCCTGTCGAAACTCGAGCCACGACCAGCGGAAGACTGGCATGTGTCAGGCCGGTTTTTCAGCAGCTGTCTCCAAGGATGGCCAAAGACTATTCTTAGGAGCTCCCGGCAGTTTTTATTGGCAAGGACGAACGTTTTCCATAGACACTAGCGCAAGGTTCAACTACTTTATGCCGAAGGAAGGTCAACTCAAAGTTCAAACCTTGAATTCGAAGCCGGTATTAATTTCGACTCCGGAAGACAGCCCCAAACATGACGACTCGTACATGGGATATTCAGTGACAGTCGGAGACTTTGCTGGTCAAGGAATACAGGGTGTTGCTGTAGGTGTTCCCAGAGGATCGGAGTTAAGAGGATTAGTGGTCCTCTATACGTGGGAGTTACAAATTATTAAAAACATCAGTGGTTCCCAAATCGGAGCCTACTTCGGCTACAGTATGGCTAGTGGTGACATCGACGGCGACGGCGCTGACGACATCATCGTCGGAGCTCCAATGTTCACCAAGACCAAGAGTGACGGATATGAGCATGGAAGAATCTATGTCATTTATCAGGGAAGCGATAGATCGTTCCAAAAGAACCACGCCAGAACTGGGGAGGTCTCAAAAGGAAGATTCGGATTGGCCGTCACGTCACTTGGGGACATCAATTATGATGGTTTTGGAGATATCGCTGTTGGTGCTCCGTATGGCGGCGAGAGTGGAAGAGGTGTGGTGTACATCTATCACGGCAGTGAGCTCGGCATCGGTGAGAAGTATTCCCAGGCTATTACTGCTGAAGAGATCAGTCCCGTCCTCACTACCTTTGGCTTTTCACTCTCGGGAGGGGTCGACTTGGACAACAACAACTATACGGATCTGGCTGTTGGTGCTTATAAATCAGACAGTGTGGTGTTTTTGAAGTCCCGTCCGGTGGTCAAAGTGATGGCGGAGGTGCGGTTCATGGGCGAGAGTAAACTTATCTCGCTGGACGACCGTCGCTGCGTCCTCTCCAACGGGACAAACGTCGCGTGCGCGCGACTTATGTTCTGTCTCACATACGATGGGGTTAACGTGGACCAACAAATTAATTTCGAAGTAACTTTGGATCTGGACTCTCGCCAGAAGACAAGCAAGCGTCTGTTCCTAATGGAGTCGAGGCAAACAGTTTACACGACTCACATATTACTTAATCAGGGGCAACAGGAATGCAAGGAAATCACTGTCTATTTAGATGAGGAAATTCGTGACAAGCTGACACCCTTGGAGGTGAAGATGACGTATGAGCTAGTACAGCCCGGAGGGTCTGCGGGAGCCTCGGGGGCGGTCCCACCAGTATTGGACCGCACCCTCGGCGTCGAGCGGACCGACTCACTCAATATACAGAAGAACTGCGGACCTGACAATGTTTGTGTTCCCGATTTGAAGATGTTTGTGACCACGCCCACAGTGAATTACGTTCTGGGATCCGGTGAAAACGTTCACATCGACGTAAAGGTGGACAATGGCGGTGAAGACGCCTTTGAAACAGCCTACTACCTGCAAATACCAGCCGGAGTAACTTACGCCAAGATGGAAAGATTGGACAAGGATACCGCCGAGACATCTCCGATATACTGTTCCATTAAAAATAAAGGAACCAGTGGAAACAGTACTTTGAAATGTGATCTCGGAAATCCCATGGCCAGTGGACAAAGCGTGAACTTCCGAGTTGTATTAGAGGTGGACACGCGAGTGACGGGGCTGAACTTCGACATGGAATCCAACTCCACAAACCCTGAGGTCGGCACGCAGTACGACAACACGAGACACTTGGCCATAGGCGTCGTCATTAAGGCAGTATTGTCGGTCATTGGGACGTCCGACCCTCCAGAACTTCACTACAATGCATCCTTATACGAGTCTCGGGACCTGAAAGATGATACCAAACTCGGTCCCCAGGTCATTCACAAGTACAACATCAAAAACGAAGGCGAGAACCTCATGTACATGGTGGTTCAACCTCAGTGGGCGGGCGACATTGTGTGTGACGTCGCAAGAAACGTTAACCCCGACAACCTGTTTATACAGAACCCATATGCCTACATGTTGACAAAAGAAAAAGAAGAAATGATGAGCAGTGAGTTCTATTCAGGCGCACAGATTGGTGGAGCTGGCGGATACTATGGACAACAATATTGGGAAGAAAAACATACCATTAGTGGTGAACAGGCGGGTCAGACTGGAGCTGGCCAAGTTAATAGAGGGCAAACTGCGTCTAACGCGTTTGGTACACAAACTTCCAGTGGAGGACAGTATGCTGCTACTAGTCAAGGAAATGGTCAATTTAATGGTGAACACTCTATTCATATTAGTGGGGGTCAAGCGACGGGCGGCCAATCGACCTACGTAAATAGAGGTCAAGCATCAAGTGGTCAATATGGCACCCACATTAGCGGTGGAAGAGTTGCAGGCGAACAGTATGGCAGTACCCAAGTATCTGGCACTCGTTACGGAACCCATGTCAGTGGACAAACGGGTGAAATATCTGGTGATCAAATGGGAACACATGTAATTGAAAATAGTGGTTCAGTTTCTAACACATACGGTACCCAGCTCAATAAAAACAATGATCAAATTTCTGGAGGACAATATTATGGTGGTCAAATGGGTGGTGGCGGACAAGGGAGTGGAAGTTCCTACTCTTATAATCGTACATGGTCCAGCGGCTCAAATGAAGGATTATCAGCTGAAGAACAAGAAGCTATACGAAAGAGATTGCAAGCTATGGCACAACCAAATTACCAAGGGTCCGGTCGAGAATATAACGAAGGAGGTCAAAATACCTATTCTCAAAGCAGTTCTCATGGTTCGGCTTATGGTCAAAATGGTTACAGCCAAGGTGGCCAATATTATGGTCGAGAACAAAATCATGGCGGACAAAGTTTTAACATTCAAGGCGGACAAGGAGTGGGATCTGCGGAATATGGCCAAGATGGACGTATAGTTCGAATTAGAAACAAAACTGTCGTGTTCGATTCTAACCATAATGTAATTTCCGAGTCTGAAACGAGTACTGAGTATGGATCATTGGGGCACGAGGGTGAAGCTGGTAGCTCTTTCAACACTTACCTTAATAATCAGGGAGTCAATCAGAATTATGGTTCTAGTCAAGGTTTTGTTCACGGTTCATGGGGTACTACGGAAACCGTTAATCCAAATCTAATGAACGCCGGATCCAAAACCTATAGAAATCCAAGCGTTACTGTCGTGGGCGATGAAGACGAGGACAAACTTGAAGGTTTTGGAGCCTACGCAGCCCAAAACTCTAATAATGAATTTAAGTTTGGTATCGCTGACGTGACTAACCAGGCCGGCTCTAGTGGAGGCGGTGCAGCATACAGATCTGGTTCACAAGTCTACCAGAACGGTCAACAAGGAGCTTTCCAGGGCGGCGTACAAGGAGGAGCGTCTTCTTCATACGGTAACAGTGGAAGTTATTCGTACTCTTCCTCACAATCAGGCGGTCACAGATCATATTCACAGTCAGGACACAGTTTCAGTTCGTCATCGGAACAGAGACGAGAAAGTACTAGAAGAAAAAGACAAGATGAAGTCGATACGGAACTGAAGGAGATGCTGTCCCGCTGCGAAGAGAAGTACAAGTGCTCAGTGGTGCGGTGTCACACGGGCCGCCTTGTGAAGGGACAGGCTGTGTGGGTCGCGCTACGGTCGCGAATCAACTCCTCTGTACTCAATGAAATCTCTCAGGAACGCTCCGTGAAACTGTCATCGCTGGGCGCGGCTCGCGTGTCCCGTCTGCCTCTGGTGGGTAGGCCCCTGGAGCGCGGCTGGCGGTCGGCGGAGGCGGTGACGACTCTCACGCCTCAGCTGGAAGCGCGTGACTCCGGCGCCATCCCCCTGTGGGTCATCATACTGGCAGCTGTCGTCGGCGCGCTGCTGTTGTTACTGCTCATCTTCGCGCTATACAAATGCGGATTCTTCAAACGCAACCGTCCCTCCGACCACGTCGAGCGTCAACCTCTGAACGGTCGCGACGAACATCTCTGA

Protein sequence:

>DPOGS209596-PA
MALFLCVSLCAWAGALGFNVDIPSRVVYTGNHQSMFGFTVQSHVDGDRKMILVGAPEDEPYRVENVQRPGAVYRCQPSHRGYTSENANNHLQMCSLIEFDKNRYNNMDTARRQIDQKSNQWFGATLTSTGRNGPIVACAPRYVSFVSAKLNQRDPVGSCFVAQTPDAVEVNEFSPCRNSSHDQRKTGMCQAGFSAAVSKDGQRLFLGAPGSFYWQGRTFSIDTSARFNYFMPKEGQLKVQTLNSKPVLISTPEDSPKHDDSYMGYSVTVGDFAGQGIQGVAVGVPRGSELRGLVVLYTWELQIIKNISGSQIGAYFGYSMASGDIDGDGADDIIVGAPMFTKTKSDGYEHGRIYVIYQGSDRSFQKNHARTGEVSKGRFGLAVTSLGDINYDGFGDIAVGAPYGGESGRGVVYIYHGSELGIGEKYSQAITAEEISPVLTTFGFSLSGGVDLDNNNYTDLAVGAYKSDSVVFLKSRPVVKVMAEVRFMGESKLISLDDRRCVLSNGTNVACARLMFCLTYDGVNVDQQINFEVTLDLDSRQKTSKRLFLMESRQTVYTTHILLNQGQQECKEITVYLDEEIRDKLTPLEVKMTYELVQPGGSAGASGAVPPVLDRTLGVERTDSLNIQKNCGPDNVCVPDLKMFVTTPTVNYVLGSGENVHIDVKVDNGGEDAFETAYYLQIPAGVTYAKMERLDKDTAETSPIYCSIKNKGTSGNSTLKCDLGNPMASGQSVNFRVVLEVDTRVTGLNFDMESNSTNPEVGTQYDNTRHLAIGVVIKAVLSVIGTSDPPELHYNASLYESRDLKDDTKLGPQVIHKYNIKNEGENLMYMVVQPQWAGDIVCDVARNVNPDNLFIQNPYAYMLTKEKEEMMSSEFYSGAQIGGAGGYYGQQYWEEKHTISGEQAGQTGAGQVNRGQTASNAFGTQTSSGGQYAATSQGNGQFNGEHSIHISGGQATGGQSTYVNRGQASSGQYGTHISGGRVAGEQYGSTQVSGTRYGTHVSGQTGEISGDQMGTHVIENSGSVSNTYGTQLNKNNDQISGGQYYGGQMGGGGQGSGSSYSYNRTWSSGSNEGLSAEEQEAIRKRLQAMAQPNYQGSGREYNEGGQNTYSQSSSHGSAYGQNGYSQGGQYYGREQNHGGQSFNIQGGQGVGSAEYGQDGRIVRIRNKTVVFDSNHNVISESETSTEYGSLGHEGEAGSSFNTYLNNQGVNQNYGSSQGFVHGSWGTTETVNPNLMNAGSKTYRNPSVTVVGDEDEDKLEGFGAYAAQNSNNEFKFGIADVTNQAGSSGGGAAYRSGSQVYQNGQQGAFQGGVQGGASSSYGNSGSYSYSSSQSGGHRSYSQSGHSFSSSSEQRRESTRRKRQDEVDTELKEMLSRCEEKYKCSVVRCHTGRLVKGQAVWVALRSRINSSVLNEISQERSVKLSSLGAARVSRLPLVGRPLERGWRSAEAVTTLTPQLEARDSGAIPLWVIILAAVVGALLLLLLIFALYKCGFFKRNRPSDHVERQPLNGRDEHL-