Monarch geneset OGS2.0

DPOGS206806
TranscriptDPOGS206806-TA2718 bp
ProteinDPOGS206806-PA905 aa
Genomic positionDPSCF300001 - 4124608-4147483
RNAseq coverage1205x (Rank: top 10%)
Annotation
HeliconiusHMEL0105530.082.45% 
BombyxBGIBMGA000629-TA0.080.78% 
DrosophilaUnr-PB0.064.76% 
EBI UniRef50UniRef50_E2B5T40.058.45%Cold shock domain-containing protein E1 n=4 Tax=Formicidae RepID=E2B5T4_HARSA
NCBI RefSeqXP_967163.10.060.22%PREDICTED: similar to AGAP004937-PA [Tribolium castaneum]
NCBI nr blastpgi|910799150.060.22%PREDICTED: similar to AGAP004937-PA [Tribolium castaneum]
NCBI nr blastxgi|910799150.060.57%PREDICTED: similar to AGAP004937-PA [Tribolium castaneum]
Group
Gene OntologyGO:00036776.1e-13DNA binding
GO:00063556.1e-13regulation of transcription, DNA-dependent
GO:00036761.9e-10nucleic acid binding
KEGG pathwayoaa:1000746302e-07 
 K01490 (E3.5.4.6, AMPD)maps-> Purine metabolism
InterPro domain[105-176] IPR0160278.6e-19Nucleic acid-binding, OB-fold-like
[261-319] IPR0020596.1e-13Cold-shock protein, DNA-binding
[756-820] IPR0111291.9e-10Cold shock protein
Orthology groupMCL14397 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206806-TA
ATGTCAAGCAATCCTCAGTGGAAAATGTTCCAGCCGCCAGATTCACACAACTCCCCTTTACAGGATATTTTAGATCTCCAACAATCAAATATGCAGGCCAAACAGGTTTATCGTAATGGTACTTACAGCGGTGAGTACCCAAGTTCCCCTCGAGATAATTTTAATAATATGGGCAAGTCAAGTGGTGGAAATAGATTTTCTGTTGGCAATTATAATTTGGATGGATGTCCCATGGGAGATTCTGTGGGAGTGTATTCTTCCAGTTCAACCAATAACCAATCTCAGTATGATTCCACGAATCACCCACCTATAAGAGAAACAGGGATTATTGAAAAATTGCTTCATTCCTATGGCTTTATACAATGCTGCGAGAGGCAAGCTCGTCTATTCTTCCACTTTAGCCAGTTTGGTGGTAATATAGACCATCTGAAAATAGGTGACCCAGTAGAATTCGAGATGACTTACGACCGTCGTACTGGCAAACCAATTGCGTCTACAGTAACTAAAATAGCCCCCGAGGTGGTACTGAGTGAGGCTCGAGTTACCGGAGTAGTGACTACGGAGGTGAAGGGAGAGGGTTCAGGGGATAACACCGGACGCATATCGTATGAGAACCGCGGCGAATGCTTTTTCTTGCCGTATACTAAAGATGACGTCGAAGGAAACGTAACCCTTCGTACAGGAGACGCTGTCAGCTTCCAAATCGCAACCAACCAAAGAGGCACTCTTGGAGCGTGTCACGTGCGTTTCGAGAACCCCGCGCATCCCGTCAAGTATCATGGCGTAGTGTGTTCTAAGAAGGAAAATTTTGGTTTCATCGAGCGGGCTGATGTCGTTAAGGAGATATTCTTCCATTACTCTGAAGCCAAGGTCAAAGAGGAGCTGTCTCTAGGAGATGACGTGGAGTTTATAATACAGACTAGGAATGGTAAGGAGGTTGCGTGCAACATCAGTAAGCTGCCGAGTGGGTCGGTTGTGTTCGAAGACGTTAGCCCGGAGCAGCTTCGGGGGCAGGTGCTGAAGCCGCTGGAGCGTGGGGCGCGACTCCAGAGCGACCCTCTACCAGGCAGGATCAGATACAGGGCACCGGACCATTCTGAAGTTGAGGTGCCTTTCGGCGACAAAGACCAGTGCGGGGAGTTCACCTTGCGGCACGGGGATTGGGTCCAGTTCCAAGTGGCAACGGACAGACGGGATCAACTGAAACGGGCAACCAATATATCGCTGCTGGATGAGTCATTCAACGTATCAGGCGAAAGACGTGAACAGGGCATTGTTTGTTCCTTGAGAGATGGCTTCGGTTTCATCCGATGCGTAGAGCGTGAGCAGACCATGTTCTTCCACTTCGCCGAGGTGTTGCGTCTCGGCCAGGAGTTAAGTGTTGGCGACGAGGTTGAGTTCACCGTGGATCCTTTGTCATCCTTCTCCAACATGAACAGCCGCCAGTCAGCCATCCGTATCCAGCACTTGTCTGCTGGTTCCGTACAGTTCGAGTCCCTTGTGGAGCGAGGCGTCCGCGGGGTCGTCACCAAGGAAGCGCATTATTCCAACGAGAGTCCCAACAGAAACTCACCGAATGAGTCCGGTATCATAACGTGTCAAATAAACAACTTGAAGAAATCTATACCATACACGGTCAAGAAATGTGAATCAAAGATGCTGCCTCGCGTGGGCGACAAAGTGACCTTCGATCTGTATCAGGTGAAGAGGACTAAGGAGCTGGTTGCTATGAGTGTGACGATGCAGCACAGCATGACGAACGGCCGGATTGGAGGGGCGGGGGCTGGAGGGGGATCGGGGGCGGCCACCCAGCAGGGCTTCGTGGCCGCACTCAAGGATGGCTTCGGTTTCATTGAGACAGCCGATCACACCAGGGAGGTGTTCTTTCATTTCAGTAATCTGGAAGGCAGCCCGGATGTATTGGAGTTGGGTTCAGAGGTGGAGTACACTGTGGGTCGTCAGAGCAGCGCTAGCGGTGGTTGTGCCAGCGCCGAACATGTGCGACCACTGCCTCGTGGAACTGTGCCCATTGCACGACCCCTGGAACCTCCTCTCACCGGCACGGTGACACGTACGCTGAGGGCCCTCAACCCCGACCAGGCCAAGTACTCTGGCTTAATCCAAGTGGAGGGTGGAATGACCTACGAATTTGGCATCATGGGACTAGCGAGCAAGCGAGAGATTCTGCAAGTTGGCGATCCGGTCACATTTCAGTCGGACATGGAAGGTCGCGCTACTAACATAGTGCCCATTAGAAAGAAGAGACGGGCCACCGTAGACGCTATAAAAGGCGGCTTCGGTTTCCTGTCTCTCGAGGCTGAGGAAGGTCGCCGTCTCTTCTTCCACATGAGTGAGGTCCGCGGGAACCCTTCAGATCTGGCGCCCGGGGACGCCGTCGAGTTTGTGATGCTAACCAACCCCAGGAACGGGAAGTGCTCAGCCTGCAATGTCGTCAAAGTCGGGAGCAACAGCAGCAATAGCAAGATATCAAAGCGCGATCGCGAGAGGGAGCGTGAGAGAGAGCGTCCAGAGAGGCCGGAGCGTCTGCTGGCCAGGCTGAGGACGGTGTCACTGGAGGAGCCCGGGCCGCGCGTGCTGGTGTTGCGACCCCCGCGGGGACCGGACGGCAGCCTCGATACCAGCCGCACCACCACCCGCCGCAGAGTCTACAGGTTCGGTAAGCAGCCCCCGCCGCCGCCGCCGCCGGGCCGTCCGTGA

Protein sequence:

>DPOGS206806-PA
MSSNPQWKMFQPPDSHNSPLQDILDLQQSNMQAKQVYRNGTYSGEYPSSPRDNFNNMGKSSGGNRFSVGNYNLDGCPMGDSVGVYSSSSTNNQSQYDSTNHPPIRETGIIEKLLHSYGFIQCCERQARLFFHFSQFGGNIDHLKIGDPVEFEMTYDRRTGKPIASTVTKIAPEVVLSEARVTGVVTTEVKGEGSGDNTGRISYENRGECFFLPYTKDDVEGNVTLRTGDAVSFQIATNQRGTLGACHVRFENPAHPVKYHGVVCSKKENFGFIERADVVKEIFFHYSEAKVKEELSLGDDVEFIIQTRNGKEVACNISKLPSGSVVFEDVSPEQLRGQVLKPLERGARLQSDPLPGRIRYRAPDHSEVEVPFGDKDQCGEFTLRHGDWVQFQVATDRRDQLKRATNISLLDESFNVSGERREQGIVCSLRDGFGFIRCVEREQTMFFHFAEVLRLGQELSVGDEVEFTVDPLSSFSNMNSRQSAIRIQHLSAGSVQFESLVERGVRGVVTKEAHYSNESPNRNSPNESGIITCQINNLKKSIPYTVKKCESKMLPRVGDKVTFDLYQVKRTKELVAMSVTMQHSMTNGRIGGAGAGGGSGAATQQGFVAALKDGFGFIETADHTREVFFHFSNLEGSPDVLELGSEVEYTVGRQSSASGGCASAEHVRPLPRGTVPIARPLEPPLTGTVTRTLRALNPDQAKYSGLIQVEGGMTYEFGIMGLASKREILQVGDPVTFQSDMEGRATNIVPIRKKRRATVDAIKGGFGFLSLEAEEGRRLFFHMSEVRGNPSDLAPGDAVEFVMLTNPRNGKCSACNVVKVGSNSSNSKISKRDRERERERERPERPERLLARLRTVSLEEPGPRVLVLRPPRGPDGSLDTSRTTTRRRVYRFGKQPPPPPPPGRP-