Monarch geneset OGS2.0

DPOGS212084
TranscriptDPOGS212084-TA2793 bp
ProteinDPOGS212084-PA930 aa
Genomic positionDPSCF300038 - 1181075-1188434
RNAseq coverage46x (Rank: top 71%)
Annotation
HeliconiusHMEL0125712e-3449.32% 
BombyxBGIBMGA006716-TA8e-6529.88% 
DrosophilaalphaPS5-PA5e-2625.57% 
EBI UniRef50UniRef50_Q1G0S73e-11532.50%Hemocyte-specific integrin alpha subunit 1 n=1 Tax=Manduca sexta RepID=Q1G0S7_MANSE
NCBI RefSeqXP_002040109.11e-2926.81%GM16026 [Drosophila sechellia]
NCBI nr blastpgi|989625021e-11432.50%hemocyte-specific integrin alpha subunit 1 [Manduca sexta]
NCBI nr blastxgi|989625022e-11932.39%hemocyte-specific integrin alpha subunit 1 [Manduca sexta]
Group
Gene OntologyGO:00083056.9e-24integrin complex
GO:00071556.9e-24cell adhesion
KEGG pathwaymdo:1000270905e-25 
 K06584 (ITGA8)maps-> Dilated cardiomyopathy
    Regulation of actin cytoskeleton
    Cell adhesion molecules (CAMs)
    Arrhythmogenic right ventricular cardiomyopathy (ARVC)
    Hypertrophic cardiomyopathy (HCM)
    Focal adhesion
    ECM-receptor interaction
InterPro domain[224-236] IPR0004136.9e-24Integrin alpha chain
[334-385] IPR0135198.6e-10Integrin alpha beta-propellor
Orthology groupMCL18521 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212084-TA
ATGATTTTCGGTATATTTAATTTGGTCGTTCTAAATATAATAACAGTATGTCAAGGCTACATCCATCTCGCCTCCATGAAGATTTACCGACACGATACAAACAAAACTAATTCTTTGTTTGGCTACAGCCTAGCTTACCAATCAGGGATTCGAAGATTGATTATCTCGTCACCACTTGAAAACGAGAATGGACAAGTTTTTACAATGTCCTTGAATTCTTCAAAAATTGAAACTGTTTCCCTGGACCCAAAACTTCTACCACGAACGCCAACTATCAAACATAACTTCTGGCTCGGTGCTACAGTGAAGGCTAATTCAAATTTCTTTGTTACATGCGCCCCACGATACGCAGAACTGAAGACAATCCGAAAGCCAATTCGTACACAATACCCAGCCACACTAGGGCTCTGTGTCTTATTTGAATATACGGATTCGGATACATATATATCTAGAAGACTCGCACACATGAGTAGTGAAGACCGATCCATAAGAGCAAAAACATTTGGAGAAAATCTAGACTCCATGGGTTGGAGTATAGACCTAGCAAACTCTGGTCAAGTCCTAATAGGATCGCCGGCGATGTTGACAGGTCGTGTGGTGTTATATGAAGATCCTTACAGCAAAGACTTCCCTAAACTCATATATAAAATGAACCAAGAAAACATAGCTCGACATAATTACGGTTACAGCTTAGCTATAGGAGAGTTTTTTGATCAGAGCACAGTCTACGCAGTCAGTGCTACGTTTGGATTTGGAAAGGTTTATTTTTTTGACTCGAAGCTTCAAAATATAGGTATTATAAAAGATATAGAAATTGGTTCGATTTTCGGCGCAGCTCTATGTCCAGCCCATCTAGGAACGAAAGCCTTGTTGGTAGGAGCTCCAGCGTATTTCGACAAAACTTACAATTACGATGTGGGAGCCGTCTACGTTTATTTGGATCAAACAGAAGAAGGTTGGAAGAAAATGCTCCTAAAACGAAAAATCAAAGGTTTATCTAGTGGAAGCTACTTTGGTCACGCTATTGCTAGTCTTGGTGATATAGATGGGGATAATAAAGATGAGATTGTCGTAGCAGCACCCTTTGAAGACGGAATCGGTGCAGTGTATTTCTTCTCTGGTTCTGGTGTTCTGGATGGATTATCTCAACCGAGAAAAATTCAACCCAAGGGTTTCCAATCCTTTGGATTCAGCCTAACAATATTGGAAGACTTGGACGGGAACGGTTGTAAAGAACTAGCTGTTGGATCTCCCAAAGATAACAAAGTGGTTATTTTTAAAACTATCGCTTTTATTAAAGTAATACTGAAAGCAGACTTAAATCAAGAGAGCGACAAAGAATTTGTATTAACGTCATGTATCGATGGAGTCTATCCTCTGAAGCCAGAAAATATAACTGCAGATATAGTAATTGAAGTGAAATTAAAAAACGCAGCGTTCACAACGGTCACCTCAAAAAATGGTGTGTATCAATACGAGGTTTCCATGGCCGAAAAGAGGAGGCTGTGTAAAAACTTCACACTCGAAGTGCATGAGGATGCTGAAAAAGGTTTCGATTACGAAATTATGTCATACGAAGTAACTGCATCTTTAAAAGAACATCCAGAAGAAGCGCTTGAGTTTGACCCATCAAGAGTCCTCTTGAGTGATGAAAGTGTTTTAAAACAACAAGGACAGAAGTGGAAAGACGATACATCGTTACAAGAACCGCAACTTCACCTCAACATATCTACTTCAATGGCACAACCGTACATGATTGGTTCTTCGTACCAAGAAACGTTTATATTGTCAGTATTAAACGAGGGTGGTGCAGCACAGGCGGCCTGTATGCATCTGGTAGTGGAAGGAGCGCGAGTCATCGCACATCCATCTTCGTGTAGACGCGAGCTGGACAGTTTAGTCTGCAGACACAACCATGTCATCAACACCAATGACTTTTGGATAAATGAAATTCTAATAGAAACAGATTATTTGACAAGTATTCATGACACTCTTACGGTCCGTTGTGATTTATATCCACAATGTGGCGGTAAAAATGTATCAAGTTACGAAAAAATTATTGAATTAAAACCTTACAACTATATGGTCGTTATTACGGGCCAATCCAATCCTGATGAAAAGTTACCAATCACTGTTGATGATCTCCATACGGGCAAATCATTTGATCATGTGTATACGATATACAACTTCGGTCTCACCAACTGGGTAGGAGTGCAAAGTGAGATCGTTTTACAAAATTCCCAATTCATCGACTACTCAGATGCACCAATAAAGGTTTATGCATACACATCTTTAATCGAATGTGAAATTGGCAATAAAAACCGATCTGAAGAAAAAATAAGAGTGTTATGCAAAATAGGTGACTTGGGAAGACAAGAAAAAGTTGTTGTCGTTGTTGCCATGAATATTGCAAGAGATACTTTAAAATTCGACAGTGAAGATCAGAATATCACAGTCACCTCGTCTATGGAATTGCTGCTTCGGGACGGAAACAAATTTTTAATCTTGAATACAACACTGACTTTCGAAGCGGCCACAGTTCCTCTTTGGGTGACTCTCGCATCCTCCTTCCTAGGCGTCTTTCTTCTCATTATAATTGCATATATTTTATATGAATATGGATTTCTTCAACGAAAGACAAAGGAAAAACTAAATGAAACTAAAAAGGAGGTGTACAGACAGAGCGTGCGCCGTTCTATGATGCGAGAGAGCATGAGAGCGGCTATCAACAGAAGGAGTGCAGAGGACAACGATATATTGATGGAGGAAGTCGTCCATGAATCAGATTTTTAA

Protein sequence:

>DPOGS212084-PA
MIFGIFNLVVLNIITVCQGYIHLASMKIYRHDTNKTNSLFGYSLAYQSGIRRLIISSPLENENGQVFTMSLNSSKIETVSLDPKLLPRTPTIKHNFWLGATVKANSNFFVTCAPRYAELKTIRKPIRTQYPATLGLCVLFEYTDSDTYISRRLAHMSSEDRSIRAKTFGENLDSMGWSIDLANSGQVLIGSPAMLTGRVVLYEDPYSKDFPKLIYKMNQENIARHNYGYSLAIGEFFDQSTVYAVSATFGFGKVYFFDSKLQNIGIIKDIEIGSIFGAALCPAHLGTKALLVGAPAYFDKTYNYDVGAVYVYLDQTEEGWKKMLLKRKIKGLSSGSYFGHAIASLGDIDGDNKDEIVVAAPFEDGIGAVYFFSGSGVLDGLSQPRKIQPKGFQSFGFSLTILEDLDGNGCKELAVGSPKDNKVVIFKTIAFIKVILKADLNQESDKEFVLTSCIDGVYPLKPENITADIVIEVKLKNAAFTTVTSKNGVYQYEVSMAEKRRLCKNFTLEVHEDAEKGFDYEIMSYEVTASLKEHPEEALEFDPSRVLLSDESVLKQQGQKWKDDTSLQEPQLHLNISTSMAQPYMIGSSYQETFILSVLNEGGAAQAACMHLVVEGARVIAHPSSCRRELDSLVCRHNHVINTNDFWINEILIETDYLTSIHDTLTVRCDLYPQCGGKNVSSYEKIIELKPYNYMVVITGQSNPDEKLPITVDDLHTGKSFDHVYTIYNFGLTNWVGVQSEIVLQNSQFIDYSDAPIKVYAYTSLIECEIGNKNRSEEKIRVLCKIGDLGRQEKVVVVVAMNIARDTLKFDSEDQNITVTSSMELLLRDGNKFLILNTTLTFEAATVPLWVTLASSFLGVFLLIIIAYILYEYGFLQRKTKEKLNETKKEVYRQSVRRSMMRESMRAAINRRSAEDNDILMEEVVHESDF-