Monarch geneset OGS2.0

DPOGS200409
TranscriptDPOGS200409-TA2403 bp
ProteinDPOGS200409-PA800 aa
Genomic positionDPSCF300236 - 511974-522264
RNAseq coverage177x (Rank: top 50%)
Annotation
HeliconiusHMEL0115952e-13656.76% 
BombyxBGIBMGA008986-TA0.070.44% 
DrosophilabetaInt-nu-PA1e-13435.74% 
EBI UniRef50UniRef50_E2BNA66e-15538.57%Integrin beta n=5 Tax=Formicidae RepID=E2BNA6_HARSA
NCBI RefSeqXP_001654414.16e-16339.05%beta nu integrin subunit [Aedes aegypti]
NCBI nr blastpgi|1571256581e-16139.05%beta nu integrin subunit [Aedes aegypti]
NCBI nr blastxgi|1571256582e-16839.77%beta nu integrin subunit [Aedes aegypti]
Group
Gene OntologyGO:00048723.8e-251receptor activity
GO:00054883.8e-251binding
GO:00071603.8e-251cell-matrix adhesion
GO:00083053.8e-251integrin complex
GO:00071553.8e-251cell adhesion
GO:00072293.8e-251integrin-mediated signaling pathway
GO:00160201.1e-11membrane
GO:00072751.1e-11multicellular organismal development
KEGG pathwaydpo:Dpse_GA145813e-132 
 K05719 (ITGB1)maps-> Axon guidance
    Leishmaniasis
    Pathogenic Escherichia coli infection
    Regulation of actin cytoskeleton
    Pathways in cancer
    Shigellosis
    Leukocyte transendothelial migration
    Hypertrophic cardiomyopathy (HCM)
    Phagosome
    Focal adhesion
    Bacterial invasion of epithelial cells
    Arrhythmogenic right ventricular cardiomyopathy (ARVC)
    ECM-receptor interaction
    Small cell lung cancer
    Dilated cardiomyopathy
    Cell adhesion molecules (CAMs)
InterPro domain[24-790] IPR0158123.8e-251Integrin beta subunit
[29-456] IPR0023696.3e-154Integrin beta subunit, N-terminal
[18-74] IPR0162011.1e-11Plexin-like fold
[748-788] IPR0148361.5e-06Integrin beta subunit, cytoplasmic
Orthology groupMCL17157 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200409-TA
ATGGCCCGTCTGTGTTACACGAACAGTCAGACGCCCACTCTTGATGACCAGTTACTCAACAAGCTGGTTTGTGTGGATCATGATGAGTGTGGATCGTGTTTGGCAGCCGCGGCTCATTGTAGATGGTGTGCGGATCCATACTATAGGTCTTCGGCTCCAAGGTGCAATGATGATGAGAGCCTGGTTGCGACAGGTTGCAGCCAAAGCATGATCCAAAGACCAGAAAAACCTGTATGGGAGGTAACTGAAAACTATACGCTTCAAGACATATCACCAGACAGTCATGAAGCAGTTGTGCAAATACAGCCACAAAAAATAAGGCTATCGTTAAAGCCACGTGAAAGCAAGAAAATAAAATTCTCCTTTAGACCGGCTAGAAACTACCCTTTAGATTTATATTATTTAATGGACCTTACTTGGTCTATGAAGGACGATAAGGAGACTTTGGTATCTTTGAGAGACGATCTGCCGACCTTGATAAAAAACTTGACGGACAATTTTAGGATAGGTTTTGGTAGTTTCGCTGAGAAGCCTATAATGCCCTTCATAAGTGTCGACGCGCGGAGACGTGAAAATCCTTGTACAGTGGAGGAACAGGCTTGTGAGGCAACTTATAGTTATAAACATCACTTGTCATTGACAAATCAGGTGAACGAATTTATAAAAGAAGTGAACAGTAGTTCCGTAACAGCCAACCTGGATAACGCCGAGGCTCAACTAGACGCTCTGGTACAGGCGATAACATGCGATAAGCAGGTCGGCTGGTCGTCACACAGTCGGAAGATTATTATACTGCTCTCCGATGGCCTGCTTCACACTGCGGGTGACGGAAAATTGGGCGGGGCGTCAATCAAGAATGATGAAACTTGTCATTTAGATGAAAATGGATATTACTCGGAAGCCGCTACTTATGATTATCCGTCCATAGCGCAAGTGTACAGAATATTAGATAAATACAAGGTAAACGTGATTTTCGCTGTGACGGAAAACGTAAAGGCCCATTACGATAGCTTGCACCAACTGTTAATTGATTTCACTCGTGTGGCAAAGTTAGAGAGTGACAGCTCAAACATTCTGAAGCTGGTCAAAATGGGTTACGAAGATATCGTGAGCGTCGTTAACTTCGAAGACGACGCCGGTCCTGGACCGGTAAAGGTAAAATACTTCACCGATTGCGGAGTGAAAGGTTCGATGGTTGAGGCGAAGCGTTGCACGGGCGTGGAGTACGGCATGACACTTAACTTCGAGGCCCACGTCAGTTTGGAATCTTGTCCCGAATACAAAAAGTCAAATCAGACTATAAGAATATCAGAAACACAATTGGGTGAGGATACTTTGACGTTGGAGATCCAGCTTCAGTGTGGTTGTAACTGCAAGAGTGATCTGATAGAGAACATGGACCTCACATGCCCGTCTAATTCACATCTCGTGTGCGGCGTCTGTCAGTGTAATAAGGGATGGTCTGGTCCATTATGCGACTGTTCAATAGAAGATGAAGCGGCCTCAGCGGCGTTGTTGTCTCAGTGTAAGGAGCCCAACGCTACCCGAGCTATAGCTTGTTCAGGGGCCGGCGACTGTCTCTGTGGGAAATGTCAATGTGACAACGGTTTCAACGGCAAGTACTGTCAGTGCAAGTCGTGCGAAGTTAGTATAATAAACCGTTTGGAATGTGGTGGTGCTGAGCATGGTGTGTGTGCTTGTGGGAAATGTGCCTGCGTTGCCGGCTGGAGCGGGGAAGCCTGCGACTGTACACTCGACACGGACCTGTGTATAGCACCAGGCCGGGAGGATGTGTGTTCAGGAAACGGAGATTGCGTCTGTGGTCGTTGTCAATGCAACAGGTTAGAGGACGGGACACTATTTTCAGGCGCATTCTGTGAGACTTGCGAGACTTGTGAGAACCCGCTCTGTGCGAATGCGGAGGCCTGTGTTCTGTGCCACTTGGACAGCAATTGCACAGACGCATGTTCAATTGGCAATATGAATTACACTGTCAGTGAGAGACTGAATGAAGTGACCAGGAATAGCGATGATGCATCTTGTATTCTGCGACGCGAGGAAGACGGTCTGGAATGCGAATACCAATATACATACAAAGCTGGCGTCCAATCTATGATAGCCATGGAAATAGTCATAAGGGACAAGCTGTGCTACCAACCCACCAGTGCTAAAATAATGACGAGCTCGCTGATCATAATGGGCTGTGTCATACTCGCCGGTTTAATAGTAATCATGGCGGTGAAAATTGCCCAGGTCTTGTCCGATAGACGAGCATACGCCAAATTTTTACAAGAGGCCCAAGAAAGTAGGAAGAATATGCAAGAATTGAATCCCATATATAAATCACCGATATCTGAATTCAAGTTACCAGAATCCTATCCGAGAGACCGGGCTGATTTATAG

Protein sequence:

>DPOGS200409-PA
MARLCYTNSQTPTLDDQLLNKLVCVDHDECGSCLAAAAHCRWCADPYYRSSAPRCNDDESLVATGCSQSMIQRPEKPVWEVTENYTLQDISPDSHEAVVQIQPQKIRLSLKPRESKKIKFSFRPARNYPLDLYYLMDLTWSMKDDKETLVSLRDDLPTLIKNLTDNFRIGFGSFAEKPIMPFISVDARRRENPCTVEEQACEATYSYKHHLSLTNQVNEFIKEVNSSSVTANLDNAEAQLDALVQAITCDKQVGWSSHSRKIIILLSDGLLHTAGDGKLGGASIKNDETCHLDENGYYSEAATYDYPSIAQVYRILDKYKVNVIFAVTENVKAHYDSLHQLLIDFTRVAKLESDSSNILKLVKMGYEDIVSVVNFEDDAGPGPVKVKYFTDCGVKGSMVEAKRCTGVEYGMTLNFEAHVSLESCPEYKKSNQTIRISETQLGEDTLTLEIQLQCGCNCKSDLIENMDLTCPSNSHLVCGVCQCNKGWSGPLCDCSIEDEAASAALLSQCKEPNATRAIACSGAGDCLCGKCQCDNGFNGKYCQCKSCEVSIINRLECGGAEHGVCACGKCACVAGWSGEACDCTLDTDLCIAPGREDVCSGNGDCVCGRCQCNRLEDGTLFSGAFCETCETCENPLCANAEACVLCHLDSNCTDACSIGNMNYTVSERLNEVTRNSDDASCILRREEDGLECEYQYTYKAGVQSMIAMEIVIRDKLCYQPTSAKIMTSSLIIMGCVILAGLIVIMAVKIAQVLSDRRAYAKFLQEAQESRKNMQELNPIYKSPISEFKLPESYPRDRADL-