Monarch geneset OGS2.0

DPOGS201302
TranscriptDPOGS201302-TA2403 bp
ProteinDPOGS201302-PA800 aa
Genomic positionDPSCF300176 - 264744-275113
RNAseq coverage331x (Rank: top 35%)
Annotation
HeliconiusHMEL0172553e-14357.19% 
BombyxBGIBMGA003050-TA1e-8764.21% 
DrosophilaSpps-PA8e-5975.44% 
EBI UniRef50UniRef50_E1ZX105e-6945.98%Transcription factor Sp4 (Fragment) n=2 Tax=Camponotus floridanus RepID=E1ZX10_CAMFO
NCBI RefSeqXP_624316.21e-6752.48%PREDICTED: similar to Transcription factor Sp3 (SPR-2) [Apis mellifera]
NCBI nr blastpgi|3287846523e-6948.83%PREDICTED: hypothetical protein LOC551928 [Apis mellifera]
NCBI nr blastxgi|3407267162e-8733.58%PREDICTED: hypothetical protein LOC100650907 [Bombus terrestris]
Group
Gene OntologyGO:00036761.1e-17nucleic acid binding
KEGG pathway 
InterPro domain[678-709] IPR0130871.1e-17Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL20679 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201302-TA
ATGTCGTCTGACTCCCACAAAGTGACAGTTGAATATATCAGTGAGGACAGAACAGTGAAGGAGGGTAGCGGCACCCAGTCGCCCCTAGCATTACTGGCGGCCACGTGTAGCCGTTACGGCGCTGTCAGCATGGGGCCCGAACAGCAGCAGGAGAAGGATCAGCAACAACAGCAACCACAGCAGCAACAGCATTACACCCAACAACAACAAGTGCAGCAACAGCAGCAGCAGCAGCAACAACAGCAACAACAGCAGCAGCAGCAACAACAGCAGCAGCAACAACAACAACAACAGCAGCAGCAACAACAACAACAACAGCAGCAGCAGCAACAACAACATCAACAACAAGTGCAGCAGCAACAGCAACAACAACAGCAACAACAACAACAACAACAACAGGTCCAACAAGCTCAGCTAGTCCAGCAACAAGTACAGGGCGAGGCACTTGCAGCACTCCAGCAGCAGTATATGCAGCAGCCGCAGCTCAGAGTCATCAGCACTGCCGTGGTGCAACAACTGAGAGCGCAAGGCCTGTCGGTGAGTGATGTGGGGAGAGATGTAAATGCTAGTGTCCAGCCGCTTCAACATGGCAATGAGTTATCAGCGGCCATTGTGAATGCTGCGACAACAGTCCCAAATGGCATTCAAGTGCAGCAACCGCAGGTGATATCAATGCAGCAGCTGCAATCTCTGCTGGGAGGCGGCGTGGTGTCGTCCAGTGAGGGCACGCCCTATCAGAACTCACCGCAACAACTGCTGCAGATACACCCACAACTGCTGCAACAACAGTGGCCGCCCCACATGCCGCCGACTGTGGTGGGAGGCGTCACTCCGCTGCAAGCCGTCACCGTGGACGGACAGGACGCTCTGTTCATACCCTCACACCACGCTCAGAACTTCTCGGGCATGGGTCAGGTGAGCCTGGTCAACGGTCAGCTGGTCCGCACGCCTGTCCTGCCGTCCGGGTTTCTCCACAACGTGATGCATCTCCCTGCGGAGCAACAAGCGACCGTGTCCATCCCCGGCACTAACCTAACAATACCGCTAAGCGCACTAACCGGTAATCAGATGGTAACTATCCCGGGAACGAATATATCGATACCGGGCGGGATACAGATACCGACCAGCCAGGCTATCACTATCCCGAGCTCCACGGGCGTGCAGTTGCCCGCGGCCGGCGTGCCCAATAACGGAGTAACGATACAGACTGACGGGAAACACGGAAACGGAAAAGAGGCCAAGTCACCCGCCAGTCCGGGGCAAGGCGGCGGGGTTGCTGTCCGTGGTGGCGTGGGCGGGGTGGGCGGCGTGGGCGGCATGGGCATGGTGCCGGTCCAGGTGCCCGTGAGCGTGGCCAACGGTCACACGGTGTACCAGACGGTCCACGTCCCAGTACACGCGCCCCACCTCCAGATAATACCGCAGCTGCAGCAGATGCAAGCCCAGCCTCAAGTGGCGAACGTGCTGACACCCTCCGGACAAATACAACAGATACAGATCGCGTCGCTCGGAAATGTTCAGGCTGTGTCTAATCCCATGCAGGACACTGGCCAGGTCCAGCAGCAGGCTATCATCACGAGTACTCCGAACGGACAGCAGGTCACTGTGATACCTGCCTCCAGTAATGTCCCGACGCTGGGCGACCCGCCAGACCCGCCGCACGTCTTGGTCCCAGCCGTGGGTTTGCCGGGCGTCCAGCTAGCGCAGATACCACAACAACAACCGCAACCCGCGCCTACACCGCAACCGCAACCTCTCATCGGTCAGCAAATACAGCAGGATCCGAACGAGCCGGGGAAGTGGCAGGTGGTGACGGTCAGCTCCGGCAACACAACTACCAGCGAGTGTGAGGCCAACAAGAACAGACCCAGCAGTCCCAACAGTGGGAAACGGTTGATGAAGCGAGTCGCCTGCACATGTCCCAACTGTGATCAGGGAGAGAACCGCCTGGTGGATCGCAAGAAACAACATGTCTGCCACATCCCGGGCTGTAACAAGGTCTACGGGAAGACCTCCCACCTAAGAGCACATCTACGATGGCATTCCGGAGAGAGACCCTTCCTTTGCAACTGGCTGTTCTGCGGGAAAAGGTTCACACGCTCGGACGAGCTGCAGCGTCACCGCCGCACACACACGGGAGAGAAACGCTTCGAGTGTCCGGAGTGCAGCAAGCGGTTCATGAGATCAGACCATCTCGCCAAACACGTCCGCATACACACCAAGAACAGGATCACGGAGGTGGCGACATCTACACAGTCAATGTATTCAGACTCCGGGGACGACAGCTGCGATGAGAAAATGATGTTGACCATAGAGACCATGCATCCCAGCAATGATGCGGAGGAGAAGCTGGTCATGATACGCTCAGGAGCCAAGCTCGAGCCGGACCACATAGACAGTTAG

Protein sequence:

>DPOGS201302-PA
MSSDSHKVTVEYISEDRTVKEGSGTQSPLALLAATCSRYGAVSMGPEQQQEKDQQQQQPQQQQHYTQQQQVQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQHQQQVQQQQQQQQQQQQQQQQVQQAQLVQQQVQGEALAALQQQYMQQPQLRVISTAVVQQLRAQGLSVSDVGRDVNASVQPLQHGNELSAAIVNAATTVPNGIQVQQPQVISMQQLQSLLGGGVVSSSEGTPYQNSPQQLLQIHPQLLQQQWPPHMPPTVVGGVTPLQAVTVDGQDALFIPSHHAQNFSGMGQVSLVNGQLVRTPVLPSGFLHNVMHLPAEQQATVSIPGTNLTIPLSALTGNQMVTIPGTNISIPGGIQIPTSQAITIPSSTGVQLPAAGVPNNGVTIQTDGKHGNGKEAKSPASPGQGGGVAVRGGVGGVGGVGGMGMVPVQVPVSVANGHTVYQTVHVPVHAPHLQIIPQLQQMQAQPQVANVLTPSGQIQQIQIASLGNVQAVSNPMQDTGQVQQQAIITSTPNGQQVTVIPASSNVPTLGDPPDPPHVLVPAVGLPGVQLAQIPQQQPQPAPTPQPQPLIGQQIQQDPNEPGKWQVVTVSSGNTTTSECEANKNRPSSPNSGKRLMKRVACTCPNCDQGENRLVDRKKQHVCHIPGCNKVYGKTSHLRAHLRWHSGERPFLCNWLFCGKRFTRSDELQRHRRTHTGEKRFECPECSKRFMRSDHLAKHVRIHTKNRITEVATSTQSMYSDSGDDSCDEKMMLTIETMHPSNDAEEKLVMIRSGAKLEPDHIDS-