Monarch geneset OGS2.0

DPOGS203951
TranscriptDPOGS203951-TA1869 bp
ProteinDPOGS203951-PA622 aa
Genomic positionDPSCF300005 + 159444-170306
RNAseq coverage40x (Rank: top 72%)
Annotation
HeliconiusHMEL0120870.069.54% 
BombyxBGIBMGA000480-TA1e-15460.26% 
Drosophilasca-PB2e-9436.26% 
EBI UniRef50UniRef50_D6WEF64e-11446.90%Scabrous n=1 Tax=Tribolium castaneum RepID=D6WEF6_TRICA
NCBI RefSeqXP_972571.18e-11546.90%PREDICTED: similar to scabrous protein [Tribolium castaneum]
NCBI nr blastpgi|910783082e-11346.90%PREDICTED: similar to scabrous protein [Tribolium castaneum]
NCBI nr blastxgi|910783081e-11146.90%PREDICTED: similar to scabrous protein [Tribolium castaneum]
Group
Gene OntologyGO:00071653.9e-71signal transduction
GO:00051023.9e-71receptor binding
KEGG pathwayecb:1000498354e-33 
 K06252 (TN)maps-> Focal adhesion
    ECM-receptor interaction
InterPro domain[423-618] IPR0021813.9e-71Fibrinogen, alpha/beta/gamma chain, C-terminal globular
[425-550] IPR0147169.7e-38Fibrinogen, alpha/beta/gamma chain, C-terminal globular, subdomain 1
[551-594] IPR0147156.3e-17Fibrinogen, alpha/beta/gamma chain, C-terminal globular, subdomain 2
Orthology groupMCL16112 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203951-TA
ATGGAGTTCGTTAAACTTTGGGCGCTAATTATTTGTTTGTGTTCGGTGAACGCTCGCGAAATTGACATTAAAAATGAATTGCTATCCCTGACCGAACAATTCAAGGCTCTGAAGACCGTGCATCTGGCTGATGTTTCCCGCCTCAAAGAAGAGATTAAGGAACTCAAGAAGCACGCTGCTAATACGTTCACTGAGAACTACACTCGCAATGAACAAGCAACTCTGCAATGGGCGAAGAGTTCCATGAGGGAACTTCGAATTGAAATGCGTGAACTAAGTCAAAGTATTAACAGCTCGGTACTGCTGCGACAATTGCAAAACATCCGCAACGAGTTAAAACGGGCATTGTCTGAGAACACGGACCTGGCTCAGTTAGCTCGTACTCAGGAGGCGCGAGTGGACAAATTGGATAGCGAAGTCGGCCGGCTCAAATACGACAGCCAAGAAATAAGAGGCATGGTCGCTGAGATACGCAGTCAAGTCGCGAAACTCTCAAAGGAGATTAAATTGAAGACACTCAATGAAGATAGCTTCAATGATGTCCTAGAGCCATACGAAAAACATGCTTCAGACTCGCATCCTAAGCATGGACACAAAATACGCCACAACAAAATGGTGCACGCCCAAATATCCCGTTTGGCCCGCAGCCAAAATCAGTTGGATGAATACCAACAGCACTTGCAGACTCAACTCCTCGATGTGCTTCGTCGCCTCGACCGCATTGAAGAAGCTAACTGGAATCTGGTTTCAACTCGAGTAGATTACCTTGCAACTGAAACTAACACAATCAAAAATGAACTGAACAATGTAACCCAACGAGTGGCCGACTTTGATAAAGTCCATGCCTCTATGCTTGAACTGCGCGAGGACGTTGAAAGCATTGAGAACAAAGCTGACAAAACAATTCCCGAGTTTAGAAAAGAGATATCTAAACTGGATCTTAGCTTCGCCCAGCTCAACGCTCAATCTTCTTATCTAAAAGAGGACCAAGAGAATCTCCGTCAATCTGTTAAGGCCATCGCAGTCAGCGTGAGCAACACCATTGATCGTGCCGAAATGGATCGTCTCGTTATCAAAGCTCTCAATGACTCTGTGATCGGTCTCGAAAATATAAGCAAGCAACACTACTACCGCCTTAACGATCACATTCTCAAGAGTGAAGCCAATAAGACAACAATGATTAGTCAATATATTCCGCTCCCTGAACTTATTGATGAAGTTAAAGAGCTTCAACCCCTGGAACGTGAGTATGAAAATCTGGTTGTTCAATTACCTAAGGATTGCTCAAGCGTGACTGGACCTGACCAAGTTTATTTAATAAACCCTGGCCATTCTCCGATTGAGACCTTTTGTACCAATGGAAGTACCCTTATTCAACGACGTTACAACGGATCCGTAGAATTTAATAGGAAATTTGCTCAATACGTGCAAGGTTTTGGTAACGCAGCCTCCGAATTTTGGCTTGGCTTGGAATCGATGCACCAATTGACCGCTGATAACTGCTCTTCTATGAGGATCGAGATGACCGATATTTATGGAAGCTCTTGGCATGCTGAATATGATCATTTCTCCGTTGGAAGCGCTGATACTGGATATGTTTTGACTGTGAGCGGTTTCAGAGGCAATGCTAGTGACGCTTTTGAGTACCAAAACCATATGGAATTTTCTGCCATCGACCACGACAGAGACATCTCGAATACTCATTGCGCTGCCAACTATGAAGGAGGTTGGTGGTTCTCTCATTGCCAGCACGTGAATATCAATGGCAAGTACACTCTTGGTTTGACCTGGTTTGACTCTCTAAGGAATGAGTGGATAGCGGTTGCAACCAGTGAGATGCGCCTATTCCGTAACAAACGCTGTACTTAA

Protein sequence:

>DPOGS203951-PA
MEFVKLWALIICLCSVNAREIDIKNELLSLTEQFKALKTVHLADVSRLKEEIKELKKHAANTFTENYTRNEQATLQWAKSSMRELRIEMRELSQSINSSVLLRQLQNIRNELKRALSENTDLAQLARTQEARVDKLDSEVGRLKYDSQEIRGMVAEIRSQVAKLSKEIKLKTLNEDSFNDVLEPYEKHASDSHPKHGHKIRHNKMVHAQISRLARSQNQLDEYQQHLQTQLLDVLRRLDRIEEANWNLVSTRVDYLATETNTIKNELNNVTQRVADFDKVHASMLELREDVESIENKADKTIPEFRKEISKLDLSFAQLNAQSSYLKEDQENLRQSVKAIAVSVSNTIDRAEMDRLVIKALNDSVIGLENISKQHYYRLNDHILKSEANKTTMISQYIPLPELIDEVKELQPLEREYENLVVQLPKDCSSVTGPDQVYLINPGHSPIETFCTNGSTLIQRRYNGSVEFNRKFAQYVQGFGNAASEFWLGLESMHQLTADNCSSMRIEMTDIYGSSWHAEYDHFSVGSADTGYVLTVSGFRGNASDAFEYQNHMEFSAIDHDRDISNTHCAANYEGGWWFSHCQHVNINGKYTLGLTWFDSLRNEWIAVATSEMRLFRNKRCT-