Monarch geneset OGS2.0

DPOGS215362
TranscriptDPOGS215362-TA1254 bp
ProteinDPOGS215362-PA417 aa
Genomic positionDPSCF300351 + 27407-31747
RNAseq coverage466x (Rank: top 27%)
Annotation
HeliconiusHMEL0045773e-15185.99% 
BombyxBGIBMGA009565-TA3e-16184.56% 
DrosophilaSsdp-PC6e-7166.80% 
EBI UniRef50UniRef50_Q9BWG44e-7151.55%Single-stranded DNA-binding protein 4 n=156 Tax=Euteleostomi RepID=SSBP4_HUMAN
NCBI RefSeqXP_973397.17e-12266.37%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|3838486161e-12466.14%PREDICTED: single-stranded DNA-binding protein 3-like isoform 3 [Megachile rotundata]
NCBI nr blastxgi|3287870021e-16267.17%PREDICTED: single-stranded DNA-binding protein 3 isoform 2 [Apis mellifera]
Group
Gene OntologyGO:00036974.4e-155single-stranded DNA binding
GO:00056341.1e-16nucleus
GO:00036771.1e-16DNA binding
KEGG pathway 
InterPro domain[2-392] IPR0081164.4e-155Sequence-specific single-strand DNA-binding protein
[81-364] IPR0075911.1e-16Single-stranded DNA-binding protein, SSDP
Orthology groupMCL13888 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215362-TA
ATGTATGCCAAGGGCAAAAGCTCTGCGGTACCTTCGGACGCTCAGGCACGGGAGAAGTTGGCCCTTTATGTGTATGAGTACTTACTGCACGTCGGGGCACAGAAAGCGGCGCAGACTTTCCTTTCTGAAATACGATGGGAAAAGAACATAACACTCGGCGAGCCGCCCGGATTCCTGCATTCCTGGTGGTGTGTTTTCTGGGACCTGTACTGCGCCGCGCCTGAAAGGAGGGACACATGCGAACACTCCTCTGAGGCTAAGGCATTCCATGACTATGGATTCGTCAATTCAGGTTATGGTGTTAACGGCATCGGTCACAACGCAGGCCCGGCGCCGCCTAATGACGGTATGGGTGGCGGAGGTATGCCACCAGGTTTCTTCCCCAACTCCTCACTCCGACCATCACCGCCAGCCCCACATCCTGGATCTCAGCCCTCACCGCATGGACCACAGCCACAGTTGATGGGGACAGGCCAGCCGTTCATAGGACCCTGGTACTCGGGAGGACCAAGAACAGCCGTCAGAATGGGCATGGGAAATGATTTTAATGGTCCTCCGGGTCAAGGCATGATGTCGAACTCCTTGGAGCGAGGCAGCGGTATGCTGGGCGGGCCGCGCATGACCCCGCCCCGCCCCGGCATGGGACCCATGAGCCCTGGTGCATATGCAGCCGGCATGCGTGGCCCACCGCCACAAGCCCCAGGTATGCCACCAATGGGTATGGGACCACGTGGCGCTTGGGCCGGCGGAAGTGGCGGCGCTGGTGGGGGATCCGCCCCCCTCAACTACAGCGGAGGCTCGCCCGGCGCGTACGGGGCGCCTCCCGGGTCCAATGGACCCCCAGGACCTCCGACTCCCATCATGCCAAGCCCACAGGACTCATCCAATTCGGGCGGTGACAACATGTACACATTGATGAAGCCGGTGGGCGCAGCCCTAGGGGCAGAGTTCCCGCTCGCCGGCGAGCACGGGCCCTCGTCGCAGCACCTACCTCAGCCTCCCACTTCCGAAGGGCTAGGCGGGGTGGACGGTATGAAGGCGTCCCCGGGCGGTGTCGGGGGCGGAGGCCCGGGGACTCCGAGAGAGGACTCCGGCTCTGGAATGGGGGATTACAATTTAAGTTTCGGCGGACCGGGCGGCGATCAGAACGACCAGACGGAGTCGGCGGCCATTCTCAAGATAAAGGAGAGCATGCAAGAGGAGGCGAAGAGATTCGAGAAGGATCCGGACCATCCAGATTACTTTATGCAGTGA

Protein sequence:

>DPOGS215362-PA
MYAKGKSSAVPSDAQAREKLALYVYEYLLHVGAQKAAQTFLSEIRWEKNITLGEPPGFLHSWWCVFWDLYCAAPERRDTCEHSSEAKAFHDYGFVNSGYGVNGIGHNAGPAPPNDGMGGGGMPPGFFPNSSLRPSPPAPHPGSQPSPHGPQPQLMGTGQPFIGPWYSGGPRTAVRMGMGNDFNGPPGQGMMSNSLERGSGMLGGPRMTPPRPGMGPMSPGAYAAGMRGPPPQAPGMPPMGMGPRGAWAGGSGGAGGGSAPLNYSGGSPGAYGAPPGSNGPPGPPTPIMPSPQDSSNSGGDNMYTLMKPVGAALGAEFPLAGEHGPSSQHLPQPPTSEGLGGVDGMKASPGGVGGGGPGTPREDSGSGMGDYNLSFGGPGGDQNDQTESAAILKIKESMQEEAKRFEKDPDHPDYFMQ-