Monarch geneset OGS2.0

DPOGS214451
TranscriptDPOGS214451-TA1050 bp
ProteinDPOGS214451-PA349 aa
Genomic positionDPSCF300441 - 28913-31530
RNAseq coverage2235x (Rank: top 5%)
Annotation
HeliconiusHMEL0077932e-5369.87% 
BombyxBGIBMGA009619-TA2e-15472.77% 
DrosophilaSocs36E-PB5e-7675.45% 
EBI UniRef50UniRef50_E3XBE51e-7774.58%Putative uncharacterized protein n=1 Tax=Anopheles darlingi RepID=E3XBE5_ANODA
NCBI RefSeqXP_001605682.11e-7667.49%PREDICTED: similar to suppressors of cytokine signalling [Nasonia vitripennis]
NCBI nr blastpgi|3123728864e-7774.58%hypothetical protein AND_19522 [Anopheles darlingi]
NCBI nr blastxgi|1565447129e-7467.49%PREDICTED: hypothetical protein LOC100122078 [Nasonia vitripennis]
Group
Gene OntologyGO:00055155.7e-33protein binding
GO:00355564.9e-10intracellular signal transduction
KEGG pathwaynvi:1001220784e-76 
 K04700 (SOCSN)maps-> Jak-STAT signaling pathway
InterPro domain[177-301] IPR0009805.7e-33SH2 motif
[288-331] IPR0014964.9e-10SOCS protein, C-terminal
Orthology groupMCL13892 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214451-TA
ATGGGACAGCAGACGTCGAGGAAGAGCGGGGAGTGTAGCTGCGGCTGCGGCGCCTGGGAGAGACGGAGAGAGGGAGACTCGCCCAGCAGTGTTCATAGATACGTGAGTGCAGTCACGGACAGATGGAGCGGGCGAGAGTGCGCGTGTCGCCGGAGACGGTGGAGGCGCCCGGCCTGTGTGTGCACCGCCTACAGACGAGTCAGCGACGCCTGCCACGACGATAGACTAGCGGCCGTCCTTACACTGGGAGCGAGAGATCTCAGGAGAGAGCTGGACGCCATAGTCATTAACACAGACGGTGACACTGGCAGAGACCATGGGGAGCCCACAGCTGAGGTCTATGTCCTGTCCGTCGGTCCGAGGAGTGACACAGACTCCACCCCCGAGGGTCGAGCTACTGAGCTGGTGCAGGCCAGCGATCAGTCTATAAGGAGGTTCCAGGTGGTGTGTGGCGGCGAGCTCCGCGCGCTTCTCCTCCGCTGCCCGCTCCCGCCCGCCCTCGTGCCGCCCACTGTACACACACAGGTCGACTACAAACACTGTCTAGTGCCGGACCTGCAGGAGATCACGGCTTGCTCGTTCTACTGGGGGAAGATGGACCGCTATGAGGCGGAGAGACTCCTGGAAAACAAGCCCGAGGGCACGTTCCTGCTCCGCGACTCGGCCCAGGAGGAGCACCTGTTCTCGGTGTCGTTCCGTAAGTACGGCCGGTCGCTCCACGCCCGCATCGAGCACTACCAACATCGGTTCAGCTTCGACTCTCACGACCCCGGGGTGTTCGCCGCCCCCACGGTCACCGGTCTCATAGAACACTACAAGGACCCGGCCTGCGTGATGTTCTTCGAGCCGATGCTGACGGCTCCTCTGCCGCGCAGCTCTCCTTTCTCTCTGCAGCAGTTGTCGCGGGCGGTGATCGTCTCTCACGTGAGCTACGACGGCGTGGAACACCTCCCGCTGCCGGCCCGCCTGAGGGCCTTCCTCAAGGAGTACCATTACCGGCAACGCGTCCGCGTCCGCCGCCTGGAGAGCGACTCGTACGAGCGAGCCTAG

Protein sequence:

>DPOGS214451-PA
MGQQTSRKSGECSCGCGAWERRREGDSPSSVHRYVSAVTDRWSGRECACRRRRWRRPACVCTAYRRVSDACHDDRLAAVLTLGARDLRRELDAIVINTDGDTGRDHGEPTAEVYVLSVGPRSDTDSTPEGRATELVQASDQSIRRFQVVCGGELRALLLRCPLPPALVPPTVHTQVDYKHCLVPDLQEITACSFYWGKMDRYEAERLLENKPEGTFLLRDSAQEEHLFSVSFRKYGRSLHARIEHYQHRFSFDSHDPGVFAAPTVTGLIEHYKDPACVMFFEPMLTAPLPRSSPFSLQQLSRAVIVSHVSYDGVEHLPLPARLRAFLKEYHYRQRVRVRRLESDSYERA-