Monarch geneset OGS2.0

DPOGS212181
TranscriptDPOGS212181-TA2481 bp
ProteinDPOGS212181-PA826 aa
Genomic positionDPSCF300038 + 1188977-1202966
RNAseq coverage46x (Rank: top 71%)
Annotation
HeliconiusHMEL0175835e-1530.45% 
BombyxBGIBMGA006624-TA9e-5927.47% 
Drosophilascb-PA9e-1525.82% 
EBI UniRef50UniRef50_Q1G0S76e-3624.75%Hemocyte-specific integrin alpha subunit 1 n=1 Tax=Manduca sexta RepID=Q1G0S7_MANSE
NCBI RefSeqXP_313576.46e-2122.57%AGAP004303-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|989625022e-3524.75%hemocyte-specific integrin alpha subunit 1 [Manduca sexta]
NCBI nr blastxgi|989625025e-3824.06%hemocyte-specific integrin alpha subunit 1 [Manduca sexta]
Group
KEGG pathwayssc:3970592e-21 
 K06476 (ITGA2B)maps-> Regulation of actin cytoskeleton
    Arrhythmogenic right ventricular cardiomyopathy (ARVC)
    Hematopoietic cell lineage
    ECM-receptor interaction
    Dilated cardiomyopathy
    Pathways in cancer
    Small cell lung cancer
    Focal adhesion
    Hypertrophic cardiomyopathy (HCM)
InterPro domain[360-413] IPR0135191.1e-06Integrin alpha beta-propellor
[300-339] IPR0135172.6e-06FG-GAP
Orthology groupMCL30449 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212181-TA
ATGAAAGGCATATATTTAATTATTTTCAATCTGCTATGTAATGAGGTTTCCTCCGGTGGGTATTATTTTCATGAACCTTCGTATATTGAATTGACTTCATCCGGAAAGGACTTTGATTTCGGATTTTCACTTGATTACCAAATTAAAACTAATAGTTTAGTGGTTGGTGCACCCAAGAGTGATTTGGACGGAAAAGTGTTTACATGTCCTCTGTCAAATATAACAAACAGCCGGAACGGAGGTGTAAAATGTCAGGAAGCATCTATTAATATGGACAGAGTTGCATCAGATTTCAGAAAACCCCCAAGTGGAGGGAATGAACACCAATGCCACCTTGGATCATCTTTGGCTGTTACTGACGCTTATTTTTTCACTTGTGCTCCTTTGTGTTCAAAATATCTTCGATTATCAAACGGTTCTTTAGTGTTCGGAACTTTCGGCACATGTTTTGTGTATGATGGTGAAAATGCCACGAGATACTGTGGTCTTCTCGAAAAGTACGAACATAGAATAAAACCTATAGCAGCTGTCGATAAATTTTATGGAGGTGTAGGATGGACAACATTTTCGGATTACAACAATAAGATTATCCTAATAGCGAAGTCTATTCTAAAAGTAAGTAGTATTACTTACATAGAAATGGACTCTCCTCTTAATCCAGTCGAAGAGGTGCCAATGCATAGCTCGTTAAATATATTTTATTACAAAGGTCGAGCTTTTGCGAGTGGAAAATTTTTCAATGATAAAAGAAATCTATATGCGTTCAGTATGAAAAAAGAAAAGCAAACAGGAGCGATTGCTTTCCTATACTACGACAAGAATCTTCCAAAGAAATTACAAGTGTTAAAAGATAACAGAAATCCTATTTTAATCTTAGATAACAAGTCTGTGTCAATGTTTGGTACTTCCTTGCATAGTGTGGACATAAATGGAGATGATTTCTCAGAGCTTATTGTTGGGGCTCCAGGCTTTTCTCAGATAGATGGTGGCTATGAGAACGGTGCTATATATTTATATGAAGGTGGAGGGCCAATTATGGAGAGAAGAGAACCGACACATCGTATTAAAGGAACCAAAGATGGTGCTCGCTTCGGTTCATCCATAGCTTCCACTGATCTGGACGAAGATGGTTTACCAGAAATTTTCGTGAGTGCACCTTACGAAAATGGAGGCGCTGGTGCTGTATACATTCTTTTAGGATACGAGGTCAAGAAATTACTTCGGAGTACCGTTAAGGAAACCTCATTATCTAGCTTCATGTATTCACAAACATTACAAATATCCGAATTTAAATCCTTCGGCTACAGTCTTCATGCATTTAACAGAAACGGTGTAAATTTTTTAAGTGTGGGTGCGCCAGCCAGTGGTCAGATTGCAATATATCGTAGCATATCCTTTATCAACGCTACTTTATCAATGTACCTGGCAGGAGATAAGGCAGTTCGGGAACAAGATGAAAATTTTATAGCCGTCATAAAAATTCATCTGGGCTATCCCGAAATTATATTAGATACAGATATAAAATTATTCCTGTGTACGGAGCTGTCTGGAGACGAAGCTAAGATTCAGAATAAAACAATTATCGAGTTCAATTTATCAAAGGAAAAACCCACCGAACTCAAATTTAATATTGTCGTATTGTTGGCTAGGAGAGGCCCGGGAGATTATAAGATCACTGCCAAAGTGAAAACGGATTTGGAAATGTCTAAAGAACGTAATATTATACCAAAAGAATTCAATAAGTCATTAGTGATGATAACTCCGCAAAGTAATAGAGAGGCTGTTTTGGAAGTGTCGCGACACTGTAAGGGCGATGAATGTGTTCCCCAACTGTCTACTTATTTGGAATGGTCTGGCAGGTCACCTGACACATACCTGTTAGGTTCGAGTGTGAAAGAGACCATGAGGATAAGTATTTTGAACGAAGGCAGCAATACGTACGATTCTTGTGCATGGATAAAAGTTACTGGAGCTCAGGTTGCAGTATTAGCATGTGTCCAAATCGATGACACTTGGTATAAGTGTGATTTACGAATTGAAAGGGATGCAACGAGTACTATTGACATACAGTTTGATTTGAGCCACCCCACTAACATGGAAGAAAGCCTTAACATTAAAGTCCTTTTATTCAACCACTGTGCGTCCTTAAGTTCGAACGCAACTTCAGAAGAAAACAAGAAAATTACTTACAAATTAACTTCTGAAAACCTATACGTTGAAGGGTCGTCCATGGATCGAAACTTCACAGAGAGTGAACTTAAAGATTTGGGAACAAATGATACTATAAGCGTTCACGAAACGTATAAGATAACGAATAATGCTTCGATATCCTGGAAGTCTGTGAAAGTTGTACTGTCATTACCGAATTTGAATTTCATGCAGGATTACTTAATGGGCTTCTTTGTAAGAAAAGAGAAAAAGAAGTTAAACAATCTCAGAGACTCAATAAGGAAAGGAACGTTTGTAAGTAGATTTGTGTAG

Protein sequence:

>DPOGS212181-PA
MKGIYLIIFNLLCNEVSSGGYYFHEPSYIELTSSGKDFDFGFSLDYQIKTNSLVVGAPKSDLDGKVFTCPLSNITNSRNGGVKCQEASINMDRVASDFRKPPSGGNEHQCHLGSSLAVTDAYFFTCAPLCSKYLRLSNGSLVFGTFGTCFVYDGENATRYCGLLEKYEHRIKPIAAVDKFYGGVGWTTFSDYNNKIILIAKSILKVSSITYIEMDSPLNPVEEVPMHSSLNIFYYKGRAFASGKFFNDKRNLYAFSMKKEKQTGAIAFLYYDKNLPKKLQVLKDNRNPILILDNKSVSMFGTSLHSVDINGDDFSELIVGAPGFSQIDGGYENGAIYLYEGGGPIMERREPTHRIKGTKDGARFGSSIASTDLDEDGLPEIFVSAPYENGGAGAVYILLGYEVKKLLRSTVKETSLSSFMYSQTLQISEFKSFGYSLHAFNRNGVNFLSVGAPASGQIAIYRSISFINATLSMYLAGDKAVREQDENFIAVIKIHLGYPEIILDTDIKLFLCTELSGDEAKIQNKTIIEFNLSKEKPTELKFNIVVLLARRGPGDYKITAKVKTDLEMSKERNIIPKEFNKSLVMITPQSNREAVLEVSRHCKGDECVPQLSTYLEWSGRSPDTYLLGSSVKETMRISILNEGSNTYDSCAWIKVTGAQVAVLACVQIDDTWYKCDLRIERDATSTIDIQFDLSHPTNMEESLNIKVLLFNHCASLSSNATSEENKKITYKLTSENLYVEGSSMDRNFTESELKDLGTNDTISVHETYKITNNASISWKSVKVVLSLPNLNFMQDYLMGFFVRKEKKKLNNLRDSIRKGTFVSRFV-