Monarch geneset OGS2.0

DPOGS203331
TranscriptDPOGS203331-TA2493 bp
ProteinDPOGS203331-PA830 aa
Genomic positionDPSCF300003 - 342534-362366
RNAseq coverage81x (Rank: top 64%)
Annotation
HeliconiusHMEL0072720.084.65% 
BombyxBGIBMGA002021-TA0.081.17% 
DrosophilaCG9095-PB0.057.20% 
EBI UniRef50UniRef50_D6WGL30.059.19%Putative uncharacterized protein n=4 Tax=Neoptera RepID=D6WGL3_TRICA
NCBI RefSeqXP_001948504.10.056.58%PREDICTED: similar to CG9095 CG9095-PB [Acyrthosiphon pisum]
NCBI nr blastpgi|3287104090.058.98%PREDICTED: hypothetical protein LOC100160092 [Acyrthosiphon pisum]
NCBI nr blastxgi|2700048420.061.62%hypothetical protein TcasGA2_TC002984 [Tribolium castaneum]
Group
Gene OntologyGO:00054882.7e-28binding
KEGG pathwayhsa:13783e-19 
 K04011 (CR1, CD35)maps-> Leishmaniasis
    Malaria
    Complement and coagulation cascades
    Hematopoietic cell lineage
InterPro domain[78-227] IPR0065851.6e-72Fucolectin tachylectin-4 pentraxin-1
[229-364] IPR0161872.7e-28C-type lectin fold
[71-226] IPR0089792.7e-27Galactose-binding domain-like
[217-366] IPR0161861.4e-23C-type lectin-like
[229-361] IPR0013043.1e-20C-type lectin
[421-480] IPR0160607.7e-16Complement control module
[425-480] IPR0004365.9e-13Sushi/SCR/CCP
Orthology groupMCL16104 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203331-TA
ATGGATGTCCTCTGCGTTTGTGTTCTTCTCGCTTTAGCTGAAGGTTTGGCAGCTTCTACTTGCGGTTTCCCTGGAGCTCCTGCTCATTGCAATGTTGCATTTTCTGACGAGGGTCTCACGGAAGGCACCGTGGCCACATACGCTTGTGAGCGAGGGTTCGAACTCCTCGGACCGTCTAGAAGGCTCTGTGATAAAAACGGCAAATGGACCCCCGATGGGATACCATTCTGTGTATTAAATGTAGCGGCCGGCAAGGCTCCGATGCAGATATCGACGGAGGACGGCGGTGTACCACAGAGGGCCCTAGATGGCAGCACATCAGCGGCCTTTAACGCTGAAACTTGCACCCTCACTAAAACGGAACGGGTCCCATGGTGGTACGTAAACCTCCTCGAGCCATACATGGTGCAGCTCGTCCGCCTCGACTTTGGCAAGCCTTGCTGTGGTGTGAACAAACCCGCAGTCGTGGTCGTGAGAGTCGGTAATAATCGTCCGGATCTCGGGACGAATCCGATTTGCAACCGATTCACCGGTTTCCTAGAGGAAGGCCAGCCTTTGTTCCTGCCATGCAATCCGCCGATGCCCGGAGCTTTCGTCAGCGTTCACCTGGAAGGCTCGAGCCCCAGTCAGCTTTCTATTTGCGAGGCGTTCGTTTACACAGATCAAGCGCTGCCGATAGAAAGATGTCCTCAATTCAGAGATCAGCCTCCGGGCAGCACGGCTACATATAATGGCAAATGCTACATATTCTACGACCGTCAGCCAGCTGACTTCAGAGACTCGCTCGGATTCTGTCGCTCTAGAGGCGGCACTCTGGTGGACGAGAGCAATCCGGCGTTACAAGGATTCATCAGCTGGGAACTGTGGAGACGACACCGGAGCGACAGCAGCAGTCAGTATTGGATGGGAGCCGTCCGCGATCCCCAGGATCCTGGTAACTGGAAGTGGGTGAACGGTAAAGACGTCACCGTGTCTTTCTGGAACGCCCCGGGCGGGAGCGAGGGCTGCGCTAGGTTTGACGGTAGCAAGGGCTGGCTCTGGGCCGACACGGACTGCCAATCAAAACTAAACTATATCTGTCAACACCAGCCGAAAGCGTGTGGTCGGCCGGAACAGCCGCCGAATTCGACGATGACGACTGAGAGTTTCGATGTCGGAGCGACCGTTGAGTACGCGTGTGATGAAGGTCACCTGTTAGTGGGACCTACCGTCCGAACCTGCATGGACACGGGCTTCTACGACGAGTTCCCACCGGTCTGCAAAAGGATAGAATGCGGCTTCCCAGCGGATATATCCCACGGGGGCTACCAGCTCATAAATTCCTCTGTGTCGTACCTCTCCCACGTTCAGTACGAATGTGATGATGGTTATGAGATGGTGGGTCGCTCGAGACTCGTCTGTGACATCGACGAACGTTGGAACGGTCCGCCGCCCAGATGTGATGTGGTCCAATGCGAACAGCCACCTCAGGTTTTAAACAGCCGCGTGTCCATTACGAACAACGTGTCTGTGTTCGGAGCCTTCGCTGAATACACATGCGCTAAGGGCTACAAAATCCAAGGAGCGCGTAGAATGAAGTGTCTAGCCACAGGAGTTTGGGACAAACCGGCGCCACAATGTCTTCCCGAGGAGCGTCCCACAACTACTACTACAACAACTACGACTACAACAACAACAACTACAACCGAGATCCCTACAACAATCACCGTCCCTACAACACAGCTGCCAACCAGCCCTCGCTTGATATTACCGACGAGGAGACTCCCGAGGCCGTCCTCGCCTAATCCATTCGTGACACCTGAACCTGCTAAAACCAGGCCGAGGCCAAAGTTCACGACGACCGAAAGAGCAACAACGAAATCGTATGACAAACCCCCGCTAAGGAAAGTCACTGTTTCCGACTCCCAGGAGAGTCAATCGAGTCAAGTGCCTCACATCATCGTGGCCAGTCATCCGCGAGAGAACCAGGTCTTCGGAAACGGAAATAATATAAGGGCGGAACAGACTCCGCGTGTGAACATTCCTCAGCCGGTGGACGGTGAGAGAAGAGAGACCCTCGGAGCTCGCCTCAACATCGGCGCGGTGGTCGCCTTAGGAGCCTTCGGCGCCCTCGTCTTCCTCATAGCCATCATTACCACCATCGTCATATTAGTTAGGAGGAAACATAACGACGGGAAGCGCTACAGGCATCACGTGTCGCCGGACTGCAATACAGTGGCGTCGCTGGACTCGTCGTCGTCGGAGTCGAGGAGCGGGTTGAACAGGTACTATCGGCAGGCGTGGGAGGAGCTCCACGAGGCCACCGGCGCCAAGCACGGCCAGAGACGACACGAGCGAGAAGCCCCCAAAGACGGCTCCGAGCTGGTCGTGTCCGACGTGTACCCGGCGGAGCACAGAGACAAACGACGACACCACCACCACCACCACCGCGAGCGACACCCCGACTGGCAACCATCACACCGTCCCCACCACAAGCACAAACCTAGATATTAA

Protein sequence:

>DPOGS203331-PA
MDVLCVCVLLALAEGLAASTCGFPGAPAHCNVAFSDEGLTEGTVATYACERGFELLGPSRRLCDKNGKWTPDGIPFCVLNVAAGKAPMQISTEDGGVPQRALDGSTSAAFNAETCTLTKTERVPWWYVNLLEPYMVQLVRLDFGKPCCGVNKPAVVVVRVGNNRPDLGTNPICNRFTGFLEEGQPLFLPCNPPMPGAFVSVHLEGSSPSQLSICEAFVYTDQALPIERCPQFRDQPPGSTATYNGKCYIFYDRQPADFRDSLGFCRSRGGTLVDESNPALQGFISWELWRRHRSDSSSQYWMGAVRDPQDPGNWKWVNGKDVTVSFWNAPGGSEGCARFDGSKGWLWADTDCQSKLNYICQHQPKACGRPEQPPNSTMTTESFDVGATVEYACDEGHLLVGPTVRTCMDTGFYDEFPPVCKRIECGFPADISHGGYQLINSSVSYLSHVQYECDDGYEMVGRSRLVCDIDERWNGPPPRCDVVQCEQPPQVLNSRVSITNNVSVFGAFAEYTCAKGYKIQGARRMKCLATGVWDKPAPQCLPEERPTTTTTTTTTTTTTTTTEIPTTITVPTTQLPTSPRLILPTRRLPRPSSPNPFVTPEPAKTRPRPKFTTTERATTKSYDKPPLRKVTVSDSQESQSSQVPHIIVASHPRENQVFGNGNNIRAEQTPRVNIPQPVDGERRETLGARLNIGAVVALGAFGALVFLIAIITTIVILVRRKHNDGKRYRHHVSPDCNTVASLDSSSSESRSGLNRYYRQAWEELHEATGAKHGQRRHEREAPKDGSELVVSDVYPAEHRDKRRHHHHHHRERHPDWQPSHRPHHKHKPRY-