Monarch geneset OGS2.0

DPOGS214325
TranscriptDPOGS214325-TA1821 bp
ProteinDPOGS214325-PA606 aa
Genomic positionDPSCF300020 - 633597-645676
RNAseq coverage430x (Rank: top 28%)
Annotation
HeliconiusHMEL0056340.080.07% 
BombyxBGIBMGA003997-TA0.078.60% 
DrosophilaSu(var)2-10-PI2e-12364.63% 
EBI UniRef50UniRef50_D6WTG26e-12958.43%Putative uncharacterized protein n=4 Tax=Neoptera RepID=D6WTG2_TRICA
NCBI RefSeqXP_001647820.11e-12962.22%sumo ligase [Aedes aegypti]
NCBI nr blastpgi|1571421132e-12862.22%sumo ligase [Aedes aegypti]
NCBI nr blastxgi|3852591168e-13879.68%PIAS2 protein [Bombyx mori]
Group
Gene OntologyGO:00082701.9e-18zinc ion binding
KEGG pathwaytca:6628543e-129 
 K04706 (PIAS)maps-> Small cell lung cancer
    Pathways in cancer
    Ubiquitin mediated proteolysis
    Jak-STAT signaling pathway
InterPro domain[303-351] IPR0041811.9e-18Zinc finger, MIZ-type
Orthology groupMCL10734 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214325-TA
ATGAGTAGATATTACGATTACGGTAACGCGATATCATGGAACAGCCAAGCGTTGTCTTCGGGCGGGCCCGGCAGCGCTCAGGGCAAGGATCCCCTGGCCCCGGCGATATCGGGTTCGAGAGCCATGTACCAACATCCGGCCGTCAGCGGATACAATCCACAAGACTCGCGATCACAGACCATGCCCGGCACGGCGCGGCAGAACCCTTTGTACGCGGGCAGTATGTACCACTACAGCGGCTCGGGCGCCACGCTGGCCCCCATGCCGTCCCCGTCACCCACCGCGCCACTGCCGCCCTTCCCAGTGCATCCTGACGTCAAGTTTAAGAAATTACCTTTCTACGATGTGTTGGCGGAACTGATGAAGCCGTCTACAATGATGCCGATGCAAGCTGGTCGCATGCAGGAGGGCACCTTCATCTTCCACCTCACGCCGCAGCAGGCCACTGAGATAGCATCAGGGAAAGACATCGTTGGCACCAGCAATAAACTCGATTATGTTATACAGGCGCAATTGAGGTTCTGCCTTCTAGAAACTTCATGCGAACAAGAGGATTATTTCCCACCTAGTGTTAATGTTAAAGTTAATAATAAGATGTGTCCGTTACCAAACCCAGTACCTACTAACAAACCTACTCCGGAGCCAAAGCGGCCGCCGCGGCCTGTGAATATATCGTCTCTCGTGAAGTTGTCTCCTACAGTAGCCAACACTATCCACGTGACGTGGGCGGCTGATTTCACACGAGCGTACGTCCTCAGCGTGTTCATGGTCAGGAAGCTAACGTCAGCGGAGCTGTTGCAGAGGCTGAAGAATAAGGGAACCAAGAACCCAGACTACACGAGATCTTTGATTAAAGAGAAACTATCAGAGGACTACGACAGCGAGATAGCGACGACATCCCTCCGCGTGTCCTTAATGTGTCCGCTGGGAAAGATGCGTATGTCGTGTCCGTGTCGACCAGCGAACTGTCCACACCTGCAGTGCTTCGACGCCTCACTCTTCCTTCAGATGAACGAACGGAAACCCACGTGGCTCTGTCCCGTCTGTGACAGACCGGCGCCTTACGACTCTCTAGTCGTTGATGGGTACTTCCAAGAGGTTCTAACGTCGCCTCGCCTGGCCAGCGAGTGTAACGAGATCCAGCTTCACGCGGACGGCAGCTGGTCCGCCCACGCGCCCCCGCCCCGCGCCCCGCCGCCCGCCGCGCCCGCCGCCGAGCCCGTCACTCTCATATCCGACGACCTTGAAGTCATACCCGTGGATGGGAACAATTCAGCCAAACGCGCAGCTGTAGGAGACAGTCGTACTCCAAAAACTGCTGAAGTATTGGTCGATCTGACATCAGACTCGGAAGATGAACTGCCACTTAAGAGGAAAATACCACAACCGAAGAGCACCCCGCCAGCCTTAGATAACATTAAGACAGACGACAACTACACTACATCCAGTGCGGAGGCGGTGACATCTAGCGGCTACCGTTCCCCGGGCGGCGGAGCGGTCATATCCCTGGACAGCCCGTCTCCGCCCGCCCCGGCCTCCCCTCACACACACTCCCACGTTACCCACACGACACACACGACATTAGACGCGGTGTCGCCGAGATCGCTCTCTAGCGACATATGTACAAGTAACAACATTGAACGCGAGGAAAACGACTCCGCGCCAACACACTGGGCACCGTATGCTGACGCAGAGAGGGAAACACATGACACATACAGAAGGGGACACGTAGCGTTCCTCGCTTTGAGCAAGTTCCGCGGTCGTCCGCCGACTCGTTGCGGCGCTGACGTGCGATACGCTGAACTAGTGTTTCCACCGTGA

Protein sequence:

>DPOGS214325-PA
MSRYYDYGNAISWNSQALSSGGPGSAQGKDPLAPAISGSRAMYQHPAVSGYNPQDSRSQTMPGTARQNPLYAGSMYHYSGSGATLAPMPSPSPTAPLPPFPVHPDVKFKKLPFYDVLAELMKPSTMMPMQAGRMQEGTFIFHLTPQQATEIASGKDIVGTSNKLDYVIQAQLRFCLLETSCEQEDYFPPSVNVKVNNKMCPLPNPVPTNKPTPEPKRPPRPVNISSLVKLSPTVANTIHVTWAADFTRAYVLSVFMVRKLTSAELLQRLKNKGTKNPDYTRSLIKEKLSEDYDSEIATTSLRVSLMCPLGKMRMSCPCRPANCPHLQCFDASLFLQMNERKPTWLCPVCDRPAPYDSLVVDGYFQEVLTSPRLASECNEIQLHADGSWSAHAPPPRAPPPAAPAAEPVTLISDDLEVIPVDGNNSAKRAAVGDSRTPKTAEVLVDLTSDSEDELPLKRKIPQPKSTPPALDNIKTDDNYTTSSAEAVTSSGYRSPGGGAVISLDSPSPPAPASPHTHSHVTHTTHTTLDAVSPRSLSSDICTSNNIEREENDSAPTHWAPYADAERETHDTYRRGHVAFLALSKFRGRPPTRCGADVRYAELVFPP-