Monarch geneset OGS2.0

DPOGS207660
TranscriptDPOGS207660-TA2043 bp
ProteinDPOGS207660-PA680 aa
Genomic positionDPSCF300133 - 74887-88712
RNAseq coverage7x (Rank: top 86%)
Annotation
HeliconiusHMEL0170243e-17464.55% 
BombyxBGIBMGA006692-TA3e-12062.33% 
DrosophilaHml-PA9e-6232.18% 
EBI UniRef50UniRef50_P980927e-17462.55%Hemocytin n=1 Tax=Bombyx mori RepID=HMCT_BOMMO
NCBI RefSeqNP_001104817.11e-17462.55%hemocytin precursor [Bombyx mori]
NCBI nr blastpgi|1624623712e-17362.55%hemocytin [Bombyx mori]
NCBI nr blastxgi|1624623710.062.55%hemocytin [Bombyx mori]
Group
Gene OntologyGO:00055154.4e-11protein binding
KEGG pathwaydpo:Dpse_GA200205e-60 
 K03900 (VWF)maps-> Complement and coagulation cascades
    Focal adhesion
    ECM-receptor interaction
InterPro domain[344-404] IPR0029191e-16Protease inhibitor I8, cysteine-rich trypsin inhibitor-like
[280-345] IPR0148533.1e-15Uncharacterised domain, cysteine-rich
[404-472] IPR0010074.4e-11von Willebrand factor, type C
[4-89] IPR0018462e-07von Willebrand factor, type D domain
Orthology groupMCL10153 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207660-TA
ATGATACGCAATGATCCGCAGCTGATAACAACAGTGAACGGCGAGCGAGTGTTCAAGTATCCTTTCAGCAGCGACTGGGCGGTCATCACCCTCGTCAATGGACAAGACGTTAACGTTTTACTTCCGGAGCTGCATGTTGAGGTGATGGTGTTGCAGTCTAAACTTCAGTTCACTGTGAGCGTCCCGTCACACGACTACAGCAACCGCACGGAAGGTCTGTGTGGAGTGTGCGCCGGGTACCAGGACCAGCTCATCACCAGCAATGGAACCGTCACCGACGACTTTGAATTGTACGGCAAGAGTTGGCAGGCGAGCCCGGAAGTACTGACGAAACTGGAAGTGCCACCTCAGGAGCAGTGCGGCGACATCCCACCGCCACCACCCTGTGTGCCTCCTCCACCGGAAAGCAATCCGTGTTACAACCTAAACAACGTAGAGAAGTTTGGAGCTCTGATAACAACAGTGAACGGCGAGCGAGTGTTCAAGTATCCTTTCAGCAGCGACTGGGCGGTCATCACCCTCGTCAATGGACAAGACGTTAACGTTTTACTTCCGGAGCTGCATGTTGAGGTGATGGTGTTGCAGTCTAAACTTCAGTTCACTGTGAGCGTCCCGTCACACGACTACAGCAACCGCACGGAAGGTCTGTGTGGAGTGTGCGCCGGGTACCAGGACCAGCTCATCACCAGCAATGGAACCGTCACCGACGACTTTGAATTGTACGGCAAGAGTTGGCAGGCGAGCCCGGAAGTACTGACGAAACTGGAAGTGCCACCTCAGGAGCAGTGCGGCGACATCCCACCGCCACCACCCTGTGTGCCTCCTCCACCGGAAAGCAATCCGTGTTACAACCTAAACAACGTAGAGAAGTTTGGAGCTTGCCACGCGCTTGTAGAGCCTCAGTCCTACATAGAGCAGTGCGAGTCAGAGCTCTGCGAGTTGAACTCGACTGACGCTTGTCCGGTGCTGGAGCGGTACGCGGCCGAGTGTCGCAAACAGGGTGTTTGCCTCGACTGGAGAAGCGATCTATGTCCATACCCATGCGACGAGCCGCTCGTATACAGAAAGTGCGTGGACTGTGAGAGAACTTGTGAAAATTACGAAGAACTGAAGGACAATCCAAAACTCTGCGATAAACAACCCGTCGAAGGATGCTTCTGTCCGGAAGGAAAGGTGAGAGTGAACAACACGTGTATCGAACCGAGCAAGTGCTTCCCGTGTGATACAAAGAAGGAACACTACGCCGGGGACGAGTGGCAAGAAGACGCGTGCACTCATTGCACGTGCAGTAAGTCGGGCGAGAGCGCACACGTGTCGTGTACAACGCGTACGTGCGCGCCGCCCGTGTGCGCTGACGGAGAGGAACGTGTGCCGGCCCCCACACCACCAGGAGCCTGTTGCAAGGAGTACCTGTGCGTTCCTAAACCGCCGGACGTGGTCTGCGACGAACCGAAGAAGATGGAATGCGGGTTCGGACAAGTTTTGAAACTGAAGAGCAAACCCGATGGATGTTCAGAATTCGTCTGCGAATGCAAGCCGGAAAGCGAATGTGAACCTCTTCCTGATGAGAGTGAAGTGGAGATGTTGGAGCCGGGGATGGAGCGCGTCGTGGACCGCTCGGGATGTTGTCCGCGAGCCTCGCTCCACTGCCGCCCCGAGGCCTGCCCCGCGGCCCCCGACTGTCCCGCACTACATAACCTACGTACTACCAATGTCACGGGCCAGTGTTGTCCCGAACACAAGTGCGAACTGCCCAAGGACAAATGCTTTGTGACTCTGGAGTGGGAGGCTGCGCCAAAAGGAGGAGAGAAGGCTCGTCCGACGCCACAGGTTATGTTGAAAGATTTGGATTCAGCCTGGCTGGACGGACCGTGCCGCTCGTGTCGCTGCGAATCAACGGCCGCGGGTCCGTCCCCCCAGTGTCACGTGACCTCTTGCCCCACTTTGATTGACATTTATTGTAGGAGGAAATATCGAATTGATCTACAGAACAGTACGCAAAAGAAGACATTAATTGACAAAAAGTCGGTTTTTAGCAATTGA

Protein sequence:

>DPOGS207660-PA
MIRNDPQLITTVNGERVFKYPFSSDWAVITLVNGQDVNVLLPELHVEVMVLQSKLQFTVSVPSHDYSNRTEGLCGVCAGYQDQLITSNGTVTDDFELYGKSWQASPEVLTKLEVPPQEQCGDIPPPPPCVPPPPESNPCYNLNNVEKFGALITTVNGERVFKYPFSSDWAVITLVNGQDVNVLLPELHVEVMVLQSKLQFTVSVPSHDYSNRTEGLCGVCAGYQDQLITSNGTVTDDFELYGKSWQASPEVLTKLEVPPQEQCGDIPPPPPCVPPPPESNPCYNLNNVEKFGACHALVEPQSYIEQCESELCELNSTDACPVLERYAAECRKQGVCLDWRSDLCPYPCDEPLVYRKCVDCERTCENYEELKDNPKLCDKQPVEGCFCPEGKVRVNNTCIEPSKCFPCDTKKEHYAGDEWQEDACTHCTCSKSGESAHVSCTTRTCAPPVCADGEERVPAPTPPGACCKEYLCVPKPPDVVCDEPKKMECGFGQVLKLKSKPDGCSEFVCECKPESECEPLPDESEVEMLEPGMERVVDRSGCCPRASLHCRPEACPAAPDCPALHNLRTTNVTGQCCPEHKCELPKDKCFVTLEWEAAPKGGEKARPTPQVMLKDLDSAWLDGPCRSCRCESTAAGPSPQCHVTSCPTLIDIYCRRKYRIDLQNSTQKKTLIDKKSVFSN-