Monarch geneset OGS2.0

DPOGS213005
TranscriptDPOGS213005-TA1692 bp
ProteinDPOGS213005-PA563 aa
Genomic positionDPSCF300024 - 203935-207434
RNAseq coverage438x (Rank: top 28%)
Annotation
HeliconiusHMEL0122980.086.49% 
BombyxBGIBMGA006913-TA0.076.79% 
Drosophilacv-2-PA1e-11644.00% 
EBI UniRef50UniRef50_Q7Q5W45e-12143.22%AGAP006168-PA n=5 Tax=Culicidae RepID=Q7Q5W4_ANOGA
NCBI RefSeqXP_001659315.12e-12345.96%crossveinless [Aedes aegypti]
NCBI nr blastpgi|3454872065e-12446.86%PREDICTED: BMP-binding endothelial regulator protein-like [Nasonia vitripennis]
NCBI nr blastxgi|3454872065e-13946.45%PREDICTED: BMP-binding endothelial regulator protein-like [Nasonia vitripennis]
Group
Gene OntologyGO:00055151.4e-09protein binding
KEGG pathwaymmu:223711e-33 
 K03900 (VWF)maps-> Complement and coagulation cascades
    Focal adhesion
    ECM-receptor interaction
InterPro domain[242-404] IPR0018462.5e-44von Willebrand factor, type D domain
[445-516] IPR0148532.7e-12Uncharacterised domain, cysteine-rich
[192-249] IPR0010071.4e-09von Willebrand factor, type C
Orthology groupMCL10794 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213005-TA
ATGGTCGGAAAGTCTCCCACGGCGTTTGCGTCTCTATTATTGACCATAAATCAAAAAGTTAACAACCTACTTAAAAGGTTGAGTCGGTTGGTTTTTGTGTGGCTCAAAGCGACCGACGTCTGGCTGGCTGGCGGGAACCGAGAATGTATCATAAATGGCCAAAGAGTTTGGGAAGGCCGCGATGTTCGGATACCAGAAGAGCCATGTTTATCGTGTCGATGTATGCGGGGTGCCTTATCCTGTACAAAGAAAGCTTGTCCAGTGCTTCCGTGTGTTATTTCGCAGCAGTATACTCCAATTGGGGAATGCTGTCCCCGATGTTCTCATCCAACAGCTGATAAAATTCTTCCAATTTCGGGATTTGCTCAGAAACCGTGTATAATTGGAAAGGAATATCACCTCAATTCGATCCCATTTCGAGTGGATCCTTGCACGGACTGTATGTGCATGAATGGTACGGCGGTGTGCTCGCGTCACACATGCCCAGTGCTAACATGCGGAGCCCGTGCTTTACCACCGGCCCCCGGGAAATGCTGTCCCGAATGTCCACAAATTGAAGAAGCGAAAGAAGCATGTGTCATTGGGGGAAAAAAATATCAGGACGGAGAAACGTGGCAATTAGATGCTTGTAAATCCTGCGAGTGTCATGGAGGAGAACCACGTTGTGCCATGGAGCGATGCCCAGTATTAAGCTGCTCGCCGGATCAGATTCTTCGTCAGTTGCCGGGACAGTGCTGTTCAAAATGCGTCGATATAGACGGAATATGCACTGTATTTGGAGATCCACACTATAAAACATTCGATGGAAAATTTTATAGTTTTCAAGGATCTTGCAAATATCAATTGGTATCCGATTGCAAAAATCACACATTTTCAATCAGAATATCCAATGATTCCAGAAATACATCACATTCCTCGTGGACACGTACTGCCACCCTTCGCATAGGGTCAACAAAAATCAATATGGGTAAAAAGATGCGTATCAAAGTTAATGGAAAGAGAATAAGTCTTCCACATATAGTAAAAGGAATAGCTGAGATAGATCGCAGTAACGGATCGGTATTACTAAAATCCGATATTGGAGTGCAAATGTTGTGGGATGGTGATGGATTTTTGGAAGTTACTGTTTCAAGTTCGTATAAAGGCAAGCTCTGTGGTCTATGTGGTAATTTTAATTCAGTAGCCAGGGATGACATGAGAACTCGTGATGGAAGACTATTAAATGATACTTGGAAGTTCGGATCGTCTTGGCGCGTGGGAGGTCATCGAGCTTGTACAAGACGACAAGAAAGACCTTATGGAATTTCGAGATGTCGGCAGTCGAAACTTACTAAAGTCAGGCGTCTATGTCGAGCATTTGATCGCCATGATGCGTTCTCAGCTTGTAGTGCCAAAGTAAATCCCCACAATTATAAGGAAGCGTGTCTCCTAGATGCGTGTAGTTGTACGGGTGTTCGGTGTCATTGTGCTGCATATAGAGCTTATGCTAGAGAATGTTCACGTGTGGGAGCGGAACCTCAAAATTGGCTTCGCGCTGCATGGTGTGAGGGTCCCCCACCTCCTTGGCTCAATCGAAGCCGTAAAGGCTTTGGTCGTTCCACGAAGCCCAAAGATCAGAATTTCTTAGACGTGGGTCTCTTACCGAAGCGAAATAATAGTCGTTCCCGGCCTCCACCTCCCATTCTACATTAA

Protein sequence:

>DPOGS213005-PA
MVGKSPTAFASLLLTINQKVNNLLKRLSRLVFVWLKATDVWLAGGNRECIINGQRVWEGRDVRIPEEPCLSCRCMRGALSCTKKACPVLPCVISQQYTPIGECCPRCSHPTADKILPISGFAQKPCIIGKEYHLNSIPFRVDPCTDCMCMNGTAVCSRHTCPVLTCGARALPPAPGKCCPECPQIEEAKEACVIGGKKYQDGETWQLDACKSCECHGGEPRCAMERCPVLSCSPDQILRQLPGQCCSKCVDIDGICTVFGDPHYKTFDGKFYSFQGSCKYQLVSDCKNHTFSIRISNDSRNTSHSSWTRTATLRIGSTKINMGKKMRIKVNGKRISLPHIVKGIAEIDRSNGSVLLKSDIGVQMLWDGDGFLEVTVSSSYKGKLCGLCGNFNSVARDDMRTRDGRLLNDTWKFGSSWRVGGHRACTRRQERPYGISRCRQSKLTKVRRLCRAFDRHDAFSACSAKVNPHNYKEACLLDACSCTGVRCHCAAYRAYARECSRVGAEPQNWLRAAWCEGPPPPWLNRSRKGFGRSTKPKDQNFLDVGLLPKRNNSRSRPPPPILH-