Monarch geneset OGS2.0

DPOGS202807
TranscriptDPOGS202807-TA2733 bp
ProteinDPOGS202807-PA910 aa
Genomic positionDPSCF300018 - 251277-258716
RNAseq coverage169x (Rank: top 51%)
Annotation
HeliconiusHMEL0037590.047.87% 
BombyxBGIBMGA010454-TA2e-16455.40% 
Drosophilascb-PA7e-5526.21% 
EBI UniRef50UniRef50_Q1G0S50.052.45%Hemocyte-specific integrin alpha subunit 3 n=2 Tax=Obtectomera RepID=Q1G0S5_MANSE
NCBI RefSeqXP_001663561.18e-6629.26%integrin alpha-ps [Aedes aegypti]
NCBI nr blastpgi|989625060.052.45%hemocyte-specific integrin alpha subunit 3 [Manduca sexta]
NCBI nr blastxgi|989625060.052.40%hemocyte-specific integrin alpha subunit 3 [Manduca sexta]
Group
Gene OntologyGO:00083055e-50integrin complex
GO:00071555e-50cell adhesion
KEGG pathwaymdo:1000274492e-58 
 K06483 (ITGA4)maps-> Leishmaniasis
    Regulation of actin cytoskeleton
    Arrhythmogenic right ventricular cardiomyopathy (ARVC)
    Hematopoietic cell lineage
    ECM-receptor interaction
    Dilated cardiomyopathy
    Leukocyte transendothelial migration
    Cell adhesion molecules (CAMs)
    Focal adhesion
    Hypertrophic cardiomyopathy (HCM)
    Intestinal immune network for IgA production
InterPro domain[208-220] IPR0004135e-50Integrin alpha chain
[448-758] IPR0136493e-22Integrin alpha-2
[334-389] IPR0135196.5e-16Integrin alpha beta-propellor
[341-373] IPR0135174.7e-07FG-GAP
Orthology groupMCL11817 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202807-TA
ATGGATTCAAATGTATTAATTGTGGGGGCGCCCAAAGCACGAAGCAAGCTGGCCAGAATGATGGCCACAGGACAGGTTTACAATTGCAAAATACTTGGTTTTGATGTTCACAACGTAACATGTTACCCTCTCGGAAGTAATGGCACAGCTCAAGACGCTATATTCGGACGGTTCGCGGGTTATTCTGATTTCTTCAGGGATGATATGTGGTTCGGAGCTGTAATCGCTTTGGTTCCCAACGGAAAGTTATTGATTTGCTCACCGAGATGGACAAATCCTTACAAGGATACACATTTACTCGCGAACGGTGCTTGTTACATCCAGGCCCAAAGGAGAGCTTTAAGTCTCCTTCCACTAAAAGACATGACCCGACAAGCGTTTATGACACAGGGTTTAAGGAAGGAATACGGGGAATACGGCACCCATCTCAATTTTTATGCCTACGGTCAAGCAGGCTTCTCAGCAAAAGTAACAGAAAACAATAGCGTTATAATCGGAGCGCCAGGCCTCTTGCAATGGACCGGTGGAATCGTGGAATACAAATATTACCCAGATCCAAGAAGTGTCCTTTTTGGTTTGCAACCTATTACGAATCCATATTACACTCCAGATTTAGGACCGGATGATTACTTAGGATACAGTGTCGAGTCTGGCATATTTGAAAAAAACGGAAGGACATTGTACGTAGCTGGAGCTCCGAGATCCAAAGCTGGTTATGGCCAAGTGTTAATAATTGAGCCATCGTTTAGAGAAAACGGACCTCTGAACATTAAAGCCAAGTTGATAGGTCATCAGCTGGGATCCTACTTCGGAGCCAGTATGTCATGCACTGATATCAATGGTGATGGTATATCGGATTTGATGGTGGGAGCACCAAATTTCGTCATTCACGATGGCAGTCTTCATTACGACCAAGGAGCGGTGTTTGTCTATTTGACAGAAAGTCAGGAATCAAATTTCACTCTGATTGAACATGCTTATGTATTTGGATCAGCACGGAGTGGATCGCGGTTTGGAAGTTCGATCGCTAATTTAGGAGATATAGACGGGGATGGTTACAATGACATAGCTATTGGTGCTCCATGGGAGAATGACGGTATCGGAGCTGTATACATTTATCGAGGTGGCGCTGATGGGTTAGTCCAGCCATTTGTACAAAAAATTTTTGTTGAAGAAGCGAGAAGTTTTGGTGTTTCGATTTCCAAGGGTGTAGATTTAACTAACGATAATTGTAATGAGCTAGCTGTAGGCGCTCTCAATTCCCGCACAGCATATATTTTCAAATGCATACCAACAATGCATGTGGACGTTTCTATTAAAGTCCCGGATGCAATGAACTTGCAACAAAACGCTACCAACTTCACTGCTTTATTCTGTGTTAATGCGCGCTCCAGTAAATTGTGGCCTCATGTGAAAATAGACTTTATAGGCAGAATAGTTATTGATCCCGAGGAAAACAGGGCAAAGCTAAAAGATGACACTGAATATGACATCACAATTGCACCAGGAGATGAGAATTGTGATGAACAAATTGTAGAAGTGATGACAACGGCAGACTTGTCTAAACCAATTTCGATGAAATTCAATTTGGAGGTCAATGAATATCCGATAGAAAACTCGGATTTGCAGCACGCCGCTAGACTTGAGGAAAACTCTATTCTAGAAACTACATTAGATATCCAACTGACGAGAGACTGTGGAGAAGATCTTATCTGTAAGCCCTTGCTAGAAATGACATTGGAACCTTTAAATAGTCCATACGTTCCAGGTTCAGAACACAGGCTTGGACTAAAAGTGACAGTTTTGAATAAAGAGGAGCCATCGTATGGGGCCAAAGTTCATCTCATTGTTCCTTCTTCACCGAAGCGTCTCCCAACTGAATGCTCTTTACAAAACTTAAATGTAACTTGTAGTCTTCCGGCTCCATTGATGAGAATGAATTCAGTTGTATTTGAAATAGAATTGGAATATATACCTATAGACAGAGCTGAAGATCTACTAATAATAAAGGCTAGACTAGAAGATCCGCTATATGAAGATTCTGATATAGAAAGGGCATTCCAGGAGTTAGACATTGTTATTACACCTAAAGCAAACTTTGCTATTAGCGGAAAATCGTTACCCAACGCGACTATACTAGTGACAAGAGACAAACTTCATGGGGACGAAAATATAACATTTGTTCATCAGTATGAGATCATGAATTGGGGACCATCTGACTGGTATCGTTTGAGAGTACAAATAATTTTATCCGAAAAGGTCAACATGTCGACTCGACTTAAAGAATGTTTGGAACTAGACCGAGTAACTCATTGCGAATGGAAGCTTCCAGCAAAAGTTTCTTTGCCAGTGGTTCTACCTCTGCGCTTTGATTTACATGATCACGGTGAATTCTTAGAAAAAAAAGTTGTGTACAACATCACCTCTACAATGACCATTCTATTGGAAGATCAGAATAAATCTGTTTCGACGATCACAACTTTGATTTTGGAGCCTGAGCAACCCTATTGGCCAGTTATTGTTGGTTGCATAGCAGGCCTCCTTTTGCTATCCGCTATTATTACAGGATTTTATAAATGTGGATTCTTCTCGAGAAAAAGAATTGAAGATTTCCAAAGACTTCAGGAACACCAGGCGGATGGAGCTTCTCCATCAGATGCGAATATTTCAGTTGGCTCACTTGGTGAGAACGATAAATCGACACAAGAATTAATCACTGATGACTCAGATTGA

Protein sequence:

>DPOGS202807-PA
MDSNVLIVGAPKARSKLARMMATGQVYNCKILGFDVHNVTCYPLGSNGTAQDAIFGRFAGYSDFFRDDMWFGAVIALVPNGKLLICSPRWTNPYKDTHLLANGACYIQAQRRALSLLPLKDMTRQAFMTQGLRKEYGEYGTHLNFYAYGQAGFSAKVTENNSVIIGAPGLLQWTGGIVEYKYYPDPRSVLFGLQPITNPYYTPDLGPDDYLGYSVESGIFEKNGRTLYVAGAPRSKAGYGQVLIIEPSFRENGPLNIKAKLIGHQLGSYFGASMSCTDINGDGISDLMVGAPNFVIHDGSLHYDQGAVFVYLTESQESNFTLIEHAYVFGSARSGSRFGSSIANLGDIDGDGYNDIAIGAPWENDGIGAVYIYRGGADGLVQPFVQKIFVEEARSFGVSISKGVDLTNDNCNELAVGALNSRTAYIFKCIPTMHVDVSIKVPDAMNLQQNATNFTALFCVNARSSKLWPHVKIDFIGRIVIDPEENRAKLKDDTEYDITIAPGDENCDEQIVEVMTTADLSKPISMKFNLEVNEYPIENSDLQHAARLEENSILETTLDIQLTRDCGEDLICKPLLEMTLEPLNSPYVPGSEHRLGLKVTVLNKEEPSYGAKVHLIVPSSPKRLPTECSLQNLNVTCSLPAPLMRMNSVVFEIELEYIPIDRAEDLLIIKARLEDPLYEDSDIERAFQELDIVITPKANFAISGKSLPNATILVTRDKLHGDENITFVHQYEIMNWGPSDWYRLRVQIILSEKVNMSTRLKECLELDRVTHCEWKLPAKVSLPVVLPLRFDLHDHGEFLEKKVVYNITSTMTILLEDQNKSVSTITTLILEPEQPYWPVIVGCIAGLLLLSAIITGFYKCGFFSRKRIEDFQRLQEHQADGASPSDANISVGSLGENDKSTQELITDDSD-