Monarch geneset OGS2.0

DPOGS200438
TranscriptDPOGS200438-TA996 bp
ProteinDPOGS200438-PA331 aa
Genomic positionDPSCF300236 + 386040-389788
RNAseq coverage1887x (Rank: top 7%)
Annotation
HeliconiusHMEL0171765e-15789.52% 
BombyxBGIBMGA008896-TA2e-16093.90% 
Drosophilagus-PD1e-12268.84% 
EBI UniRef50UniRef50_Q96A449e-10166.91%SPRY domain-containing SOCS box protein 4 n=109 Tax=Eumetazoa RepID=SPSB4_HUMAN
NCBI RefSeqXP_002430723.14e-12373.12%SPRY domain-containing SOCS box protein, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3287179596e-12372.92%PREDICTED: SPRY domain-containing SOCS box protein 1-like [Acyrthosiphon pisum]
NCBI nr blastxgi|2700116571e-12367.87%hypothetical protein TcasGA2_TC005709 [Tribolium castaneum]
Group
Gene OntologyGO:00055152.4e-18protein binding
GO:00355563.5e-10intracellular signal transduction
KEGG pathway 
InterPro domain[84-302] IPR0089858.7e-64Concanavalin A-like lectin/glucanase
[148-286] IPR0183553.6e-22SPla/RYanodine receptor subgroup
[149-280] IPR0038772.4e-18SPla/RYanodine receptor SPRY
[289-329] IPR0014963.5e-10SOCS protein, C-terminal
Orthology groupMCL11675 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200438-TA
ATGAAAAAAACAAACATTAAAGTTTCGGTAGATGCGGTCATGTACAAGAGCGGGGTGCGCTCAGTGAGGGGTGGCGGCGGGGGCGGCGAGGAGTCGCGTCGCGGCGGCCTCAAGGCACTGCCGAAGCGCGCCCGGACACGCTGCAGGGGTATGAACATGGGTCAAAAGCTATCGGGCGGTGTGAAGTCGGTGTCGCGCGAGTGCACGGCGCCGTTCAAGGCGGTGGTTGCACGCGAGCTGGCACCAGAGCTGCCGAGACCTGCGCGTCTTGATGCGCTCCTGGACGCGCCGCCAGCGCATCACGACACGCAGCTCAAGCACGCCTGGAACCCAGATGACCGGTCGTTGAATATATTCGTGAAGGAGGAGGACGCGTTGACGTTCCATAGACATCCCGTAGCCCAGTCGACGGATTGCATCCGTGGGCGCGTCGGGTACTCACGTGGGCTGCACTGCTGGGAGGTTGTGTGGCCGGCGAGGCAGAGGGGAACACACGCGGTCGTGGGCGTAGCGACCGCCCACGCGCCCTTACACTCCGTGGGCTATCAGAGCCTCGTGGGCGCCACCGATCAGAGTTGGGGCTGGGATCTTGGCAGGAATAAGGTGTACCACAACGCTAAGGGTACAGGCAGTAGCGGCTGTACGTATCCAGCGCTACTGCGACCAGATGAACAGTTCCTCGTGCCGGACCGACTGCTGGTAGTTCTGGATATGGACGAGGGCACGTTGTCCTTCTGCGCCGATGGAAGATACCTCGGCGTGGCCTTCAGGGGACTCAGAGGGAAAACTCTATACCCCATAGTGTCCGCGGTTTGGGGCCACGCTGAGATCACCATGAAATACATCGGCGGACTTGATCCCGAGCCGCTTCCCCTGATGGAGTTGTGTCGTCGTGTGATCCGCCAGCGCGTGGGTCGCGGCCGCCTCCGCTCGGCCGCGTCCCGCCTCGCCCTGCCGCCCGCCCTCTCCGCCTACCTTCTGTACCGCGCGCCCTAA

Protein sequence:

>DPOGS200438-PA
MKKTNIKVSVDAVMYKSGVRSVRGGGGGGEESRRGGLKALPKRARTRCRGMNMGQKLSGGVKSVSRECTAPFKAVVARELAPELPRPARLDALLDAPPAHHDTQLKHAWNPDDRSLNIFVKEEDALTFHRHPVAQSTDCIRGRVGYSRGLHCWEVVWPARQRGTHAVVGVATAHAPLHSVGYQSLVGATDQSWGWDLGRNKVYHNAKGTGSSGCTYPALLRPDEQFLVPDRLLVVLDMDEGTLSFCADGRYLGVAFRGLRGKTLYPIVSAVWGHAEITMKYIGGLDPEPLPLMELCRRVIRQRVGRGRLRSAASRLALPPALSAYLLYRAP-