Monarch geneset OGS2.0

DPOGS206898
TranscriptDPOGS206898-TA3573 bp
ProteinDPOGS206898-PA1190 aa
Genomic positionDPSCF300001 - 1787073-1795398
RNAseq coverage51x (Rank: top 70%)
Annotation
HeliconiusHMEL0068560.061.88% 
BombyxBGIBMGA012855-TA7e-11560.93% 
DrosophilaCG2990-PB1e-18036.61% 
EBI UniRef50UniRef50_UPI00015B4EC20.038.35%UPI00015B4EC2 related cluster n=1 Tax=unknown RepID=UPI00015B4EC2
NCBI RefSeqXP_001606961.10.038.35%PREDICTED: similar to CG2990-PB [Nasonia vitripennis]
NCBI nr blastpgi|3228001760.038.97%hypothetical protein SINV_05223 [Solenopsis invicta]
NCBI nr blastxgi|3228001760.038.89%hypothetical protein SINV_05223 [Solenopsis invicta]
Group
Gene OntologyGO:00036773.9e-49DNA binding
GO:00055243.9e-49ATP binding
GO:00062603.9e-49DNA replication
GO:00040033.9e-49ATP-dependent DNA helicase activity
KEGG pathwaynvi:1001233350.0 
 K10742 (DNA2)maps-> DNA replication
InterPro domain[230-428] IPR0148083.9e-49DNA replication factor Dna2
Orthology groupMCL15267 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206898-TA
ATGCAAGAAAGTTCGAGTTCCAAAAAGAAATCAGCTCCAAATAAGCAGCAGACAATCCTAAATTTCTTTGGAAATGCCTCAGACCTTAATAAAACACGTAATCAAGAAGCCTGTTCGGAAGGAAAATCTAAGTCACTTAAAAGAAAAGCCCATTCACCCTCAAAAATTCACTTAGACTCAATTTCACATAAAGTAACAGGGAAATTTAGTTCCGATAATAATACTACTAAAAAGTTAAGGATGGACACAAATACCTTAAACGATTTGAAAGGCAAACATATAGAACATGATGCTATAAAAGTGAAGTCACCTAAGAAGAAAAATATTTTAAGCTGCAAAAGTGGCTTAAGTTTAAACAAGGAAAGTAATGTTGAAAATATAATAAAGCATTATGAGCTTGAACCAATGAATCAAAAGGAAAATATATGTAATAGAAAAATTGAAATAACTTGTAGTTCAACTGATTTGTCAAATGACCTAAATATATCACAAAATGATTTTAACTCACAACCATCACATGATGGTTCCAGCTCAAATGAACAGAGGATTAGTCGTGCTGAAATCGATGATATATTCATAGACGACTTTGATTTAGAGGATATCAAAGATAATTTGGATTTGACAACCATGCAGAGGTGTAAGATACTTGATATCATAAAACATCCGCAGTCATTTGAAATCATATTGAAAAATAATGTAAATAATAGAGGAACTGTTTTCATAGAGGGAAATTGGTTGAACACATACCTGCAAACCGGGGATATAGTTAGCATTCTGGCATCTCGAGACTCGTCTGGTCGGTTTAGAATAAATAATACGCAAGGTCTCCTTGTGTTGAGGCCAGACCATTTAATATCTTCAACAAGTGTCGTCTCTGGTGTGTTCTGTAGACGGAAAGCTGTGTTACAAGAAAGATGGAAGGGAATTGATTCTGCATCAGTTACTATGACAATTGGTACATTGGTTCACGAAATGGTACAGAGAGCTTTAACTCAGAAAATTTCAAATGTGCCACAAATTAAAGTTTTATGTGAAAATATAATTAAGGAATCCATTGAGATGTTGTATGATGCTGGGATAACGGAGGCAGATGTGAGAACAAACATGCAGGTGTATTTAGAACCGCTGGCGGAATTTATGCAGACGTATGTGGTTAATGATAAAATGATTGACTCAAAGAAATATCAGTGGAAAGGTAAAGTAGAGAAAGTTTTAGACATAGAAGAGAATGTTTGCTGTCCTCAGATGGGTTTGAAAGGTAAAATTGATGCAACACTCAAAGTCACGATTCACAATAGAAAAGACAAAACCACAACAGTACCATTGGAGTTAAAAAGCGGTCGAGCAAGTGTTTCCGCAGAGCATACAGGACAGCTAGTATTATATGGTATGATGATGAGTGTCCTCGAAGGGAAGGATCCGGCCAGGGGGGACCAGAGGGGCCTGTTGTTGTACCTAAAGGACAAGATAAACATCACTGAAGTTAATTGTGATTATCCGGAGAGACGAGACCTGGTGATGCTTCGTAATGAGCTCGTCCAACATCTAGCAGCAGGACCGAATGATACGTCTCAAGACGAGTTAACGGACATCGAAGATTTGGGGAAGTACCACCAAAGTTTGCCAGAACCGGTGAATCATCACTCTGCCTGCTCCAAATGTGCATATCTCACTCTGTGTTCATTACATTTGTGGCATACAAATGGAACTACAGTATCAAGTGGCCACCCTTTAAGCAAACTCAAAAGTACAGCCCTGGGACACCTATCCTCTGAACATATACAGTACTTCCTTAAATGGGCGAGCTTACTTCGTGTAGAGGAGAAAATGCAGCTGATGAATTCACCAATACACGCCCTGTGGACTGATAGCACTGATATTCGATCTAAACGCGGAAGTTGTGCTCCGAATTTAACTCTGAGTCGTGTTTCAAGTTCCAACGGCAGATACTTACATGTCTTCATTAGAAACGGAACAAAAACAGAGACGCTCAGTGGTACGCAGTATATTAAAGGACCTCAAGCAGGTGATTTTTCTATAGTGAGCATAGATAATAGACCCTGGGTTGCAGCCGGAGCGGTTACATTAAGCGATTCCAAAGAATTGCAAATATTATTGGACAGAGACCTATCACTTCGTCTGTCAAGTAACACTAAATACCACATCGACATCTACGAAAGTTACGCTACGACTGTACAAAACCTAACTAACCTCGGTCTGCTTATAGAAGACACCCACCAAGCTTCGAAATTAAGAAAATTAATAATAGATAAAGAAAGTCCAACGTTCACACAGAAGCTGCCGCCCCGTGTACACAAGCTGAGTAAGGAGTTACTCGTCACACTGAACGAGGAACAACAGGCAGCAGTACTGAGAGTGTTGGAGTGCGATGATTATGTTCTTTTGCAAGGGTTACCTGGCACTGGGAAGACTCAAACTCTGTGTGCACTGATTCAACTGCTGTGTTCTTTGGGTATGCGTGTGTTGGTCACCGCTCACACACATTCAGCCGTCGATACACTTCTTAGCAGACTACCTTCATCTCTAAAAGTACTGCGTGTTGGTACTTCGTCGCGAGTATCGGTTTCTGCGGCGGTCAAAGAGTGCACCACAGTGGAACAGTTGACCAACCTTTATAACTCAGTGCAAGTGGTAGGAGTCACATGTCTCGGAGCATCTCACGCGTTGCTGTCGAAGAATACATTCGACTTTTGTATTGTGGATGAAGCGACACAAGTACTACAAAGCACAGTCCTCCGTCCGTTGTTTGCTGCGAATAAATTCGTGTTAGTTGGAGACCCCGAACAACTACCGCCAGTCGTCAGGAGTAGAGATGCTAAGTTTCTAGGAATGGAGGTGAGCTTATTCCATAGCCTGATGAAGGATAGAACAACGTGCACTCTAAGCCTCCAATACAGAATGAACCAACCTTTAGCAGACCTCGCCAATAAAATTGCGTACAGCAATAGACTTAAATGTGCCAATGAAACTGTAGCAAAGGCAGCACTTAATATAAATAAAATGAAACTGTCAGAAAGCTCTTCGGATCAATGGATTACGACTGTGTGCAGTCCGGAACCCGAAGATGCGGCCGTTTTCCTTAATACAAAGATGGAATTATCTGAAGACGCTTCTAAGACTTTGAACAATAAAGACGAAGCTGTGGTTGTGCTAGCTGTGATTAAAATTTTAAAACAGGCGGGTATATCAGCGTCAGATATAGGGGTGATAGCTCCGTACCGTGATCAGGTCACTTTGCTGAAACGCGTCCTGGACGGTACACAAGTCGAAGCCAGTACTGTAGATCAGTTCCAAGGCAGAGATAAAAGTGTCATAATATACTCTTGTACAAGAAAAGATGATAATTTAAGAAAAGTCAAGGATAACGAAGTCCTGAACGACAAACGTCGTCTGGCGGTTAGCGTGACGCGTGCGAAACACAAGTTTATTGTGATCGGTAACATTCGCGCATTGAAAAGATACACTACGATAATGAAACTAGAAGAGGCTTGTAAGACTGTTGATTTGGATGAAGAGGTTGTTACAAACCTGAATAATAAATATGGGTTCTGA

Protein sequence:

>DPOGS206898-PA
MQESSSSKKKSAPNKQQTILNFFGNASDLNKTRNQEACSEGKSKSLKRKAHSPSKIHLDSISHKVTGKFSSDNNTTKKLRMDTNTLNDLKGKHIEHDAIKVKSPKKKNILSCKSGLSLNKESNVENIIKHYELEPMNQKENICNRKIEITCSSTDLSNDLNISQNDFNSQPSHDGSSSNEQRISRAEIDDIFIDDFDLEDIKDNLDLTTMQRCKILDIIKHPQSFEIILKNNVNNRGTVFIEGNWLNTYLQTGDIVSILASRDSSGRFRINNTQGLLVLRPDHLISSTSVVSGVFCRRKAVLQERWKGIDSASVTMTIGTLVHEMVQRALTQKISNVPQIKVLCENIIKESIEMLYDAGITEADVRTNMQVYLEPLAEFMQTYVVNDKMIDSKKYQWKGKVEKVLDIEENVCCPQMGLKGKIDATLKVTIHNRKDKTTTVPLELKSGRASVSAEHTGQLVLYGMMMSVLEGKDPARGDQRGLLLYLKDKINITEVNCDYPERRDLVMLRNELVQHLAAGPNDTSQDELTDIEDLGKYHQSLPEPVNHHSACSKCAYLTLCSLHLWHTNGTTVSSGHPLSKLKSTALGHLSSEHIQYFLKWASLLRVEEKMQLMNSPIHALWTDSTDIRSKRGSCAPNLTLSRVSSSNGRYLHVFIRNGTKTETLSGTQYIKGPQAGDFSIVSIDNRPWVAAGAVTLSDSKELQILLDRDLSLRLSSNTKYHIDIYESYATTVQNLTNLGLLIEDTHQASKLRKLIIDKESPTFTQKLPPRVHKLSKELLVTLNEEQQAAVLRVLECDDYVLLQGLPGTGKTQTLCALIQLLCSLGMRVLVTAHTHSAVDTLLSRLPSSLKVLRVGTSSRVSVSAAVKECTTVEQLTNLYNSVQVVGVTCLGASHALLSKNTFDFCIVDEATQVLQSTVLRPLFAANKFVLVGDPEQLPPVVRSRDAKFLGMEVSLFHSLMKDRTTCTLSLQYRMNQPLADLANKIAYSNRLKCANETVAKAALNINKMKLSESSSDQWITTVCSPEPEDAAVFLNTKMELSEDASKTLNNKDEAVVVLAVIKILKQAGISASDIGVIAPYRDQVTLLKRVLDGTQVEASTVDQFQGRDKSVIIYSCTRKDDNLRKVKDNEVLNDKRRLAVSVTRAKHKFIVIGNIRALKRYTTIMKLEEACKTVDLDEEVVTNLNNKYGF-