Monarch geneset OGS2.0

DPOGS203317
TranscriptDPOGS203317-TA1926 bp
ProteinDPOGS203317-PA641 aa
Genomic positionDPSCF300003 - 894313-899641
RNAseq coverage113x (Rank: top 59%)
Annotation
HeliconiusHMEL0166280.084.79% 
BombyxBGIBMGA012306-TA0.086.00% 
DrosophilaCG41520-PA0.053.82% 
EBI UniRef50UniRef50_B4J6F80.063.69%GH21726 n=10 Tax=Coelomata RepID=B4J6F8_DROGR
NCBI RefSeqXP_973184.10.069.73%PREDICTED: similar to CG41520 CG41520-PA [Tribolium castaneum]
NCBI nr blastpgi|3072021900.067.47%Techylectin-5B [Harpegnathos saltator]
NCBI nr blastxgi|910800490.069.97%PREDICTED: similar to CG41520 CG41520-PA [Tribolium castaneum]
Group
Gene OntologyGO:00071651.6e-84signal transduction
GO:00051021.6e-84receptor binding
KEGG pathway 
InterPro domain[414-635] IPR0021811.6e-84Fibrinogen, alpha/beta/gamma chain, C-terminal globular
[418-558] IPR0147161e-48Fibrinogen, alpha/beta/gamma chain, C-terminal globular, subdomain 1
[559-626] IPR0147157e-19Fibrinogen, alpha/beta/gamma chain, C-terminal globular, subdomain 2
Orthology groupMCL16571 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203317-TA
ATGAGGAGAGTCGAAGCGTTGGACTCGCGAGTAAATGATAATATTCATAAAACTGACGCCATCATATCCAAATTAGGTAATCTCGATGTAAAACTTTTCAGTAGAATCTATCAAGGCGATGATGAAAGTTCTACTGAAAACATCAACAATAAAAAGAACGAAATGAGCGATGCGAAACTTTTGGATAAAAAACTGGAATCTTTAGATCAAAAAGTTTCAGGCATTGACACCAAATTAATAGGACTCAAAACACAAATCGATAACAACTTTCTGCCAGTCGATGACATCAATGCAGAAGCGAGTGAAAAGAAACCTATAAATTTAAACGTGATTGAAATCGCAAAGAGCCTGAATACTGAAGTTATCAATGAAATAACAAAAGAAGTCGATCAGTTGAGAACTTCCATGTCGACTGTTGATAGGAAGTTACAATTTCATATAAATCTTGTGTCGGAAAATTTAGGAAAAGTGCTTTACATGATGGCGGACGTCCACGCTGCTATCGTCGAGCCAGAATCTGCCTCCATCAACATAAACGGTCTCTTTAAAAACAGAACAACCACGAGCACCCCATCACCAGCAACTAAAGCAAGTAAAATAGATACTCTTGTGAATAAAATTATTCCTATGATGAGCGTTTCTGAGAAAATGGACGAGGTGTGGGACGTCGTCGTTGGAACTAAGAGTTCAGTCGACGATTTAGTACCAAAATCTGATGAACTCCTGACACAAACACAGAGACAGGAACGTGCTATAGGCCAGATACACAATGATTTAAGACTCAAAACAAATCTAATAATAGAAAATCTGGACATGGTTGAAAAGCGTTTGAAGAAACAAGAAGACGATGTAGCAACTTTGGCGCAACGTCCAGTGCCGGCTGAACTTCTCCTAGATCCGACGATAGACAGGCTTGTGGAGTACGCTCCAAACAGATACAAAGTCGACGAACTCTTGACGGAGCCGACTACCAATGCACCGGTCACAACACAAGCCTCATCGACTGCTAACATAAGCCCGAGCAGTCCGAGCAATGTGAGTGCGAGTTCAACAAGTGCCACCGTTACGGTGACACCGGCGGCCGCAGGCAGTGGAGGCAGTGCCGGCAATAGCGGCGCTGGCCCTACCAGCTCTCGACCTTCCAGCCGTAAAGGCGGCATAATCTTCCCCAGCGTTAAGAACAAGCCCATCATAGGCAACAACACCTTCGCATCAGAGATCGTCGCCAACTATAAAGATGTTAAAGGCTACTCTTGCGTGGATCTTCTGAATGCAGGCATGCGTGAGTCCGGAGTCTACTACCTTCAGATACGAGGAACTACTTACTGGTTCCTCAAAGTATTCTGTGAGCAGAACGTCGCCGACGGTGGTTGGACGGTGATTCATCGTCGCGATGACTTCGGAATACCAGCGGAGAACTTCAATCGGGACTGGAGCGACTATAAAAATGGCTTCGGTGATCCAAACAAGGAGTTCTGGCTCGGAAACGAGAACATTTACATGCTAACCAACAATGACGACTACATGCTCCGAGTGGAACTGGAAGATTTCGATGGCAATAAGAGATACGCTCAGTATTCGCACTTCAAAATATACTCCGAAGCGGAATACTACAAGTTAGAGATAGACGGCTACGACGGCAACGCTGGTGATTCCCTGAACGACCCGTGGTATGGATCTAACAACAGTCCATTCTCTACTTACAATAGAGACAATGACAGGTCATCGTTGAATTGTGCGTCCATGTTGAAAGGAGGCTGGTGGTGGAAGTCCTGCGGACGAGGTCTCAACGGACTGTACCTTCACGACCCCCAGGACCTCACAGCCAGGCAAGGTATCGTCTGGTTCCGTTGGCGCGGCTGGGACTACACTCTTAAACGCGCCTCAATGATGATCAAGCCCAAGGGACTTCTACCGAACACATGA

Protein sequence:

>DPOGS203317-PA
MRRVEALDSRVNDNIHKTDAIISKLGNLDVKLFSRIYQGDDESSTENINNKKNEMSDAKLLDKKLESLDQKVSGIDTKLIGLKTQIDNNFLPVDDINAEASEKKPINLNVIEIAKSLNTEVINEITKEVDQLRTSMSTVDRKLQFHINLVSENLGKVLYMMADVHAAIVEPESASININGLFKNRTTTSTPSPATKASKIDTLVNKIIPMMSVSEKMDEVWDVVVGTKSSVDDLVPKSDELLTQTQRQERAIGQIHNDLRLKTNLIIENLDMVEKRLKKQEDDVATLAQRPVPAELLLDPTIDRLVEYAPNRYKVDELLTEPTTNAPVTTQASSTANISPSSPSNVSASSTSATVTVTPAAAGSGGSAGNSGAGPTSSRPSSRKGGIIFPSVKNKPIIGNNTFASEIVANYKDVKGYSCVDLLNAGMRESGVYYLQIRGTTYWFLKVFCEQNVADGGWTVIHRRDDFGIPAENFNRDWSDYKNGFGDPNKEFWLGNENIYMLTNNDDYMLRVELEDFDGNKRYAQYSHFKIYSEAEYYKLEIDGYDGNAGDSLNDPWYGSNNSPFSTYNRDNDRSSLNCASMLKGGWWWKSCGRGLNGLYLHDPQDLTARQGIVWFRWRGWDYTLKRASMMIKPKGLLPNT-