Monarch geneset OGS2.0

DPOGS200440
TranscriptDPOGS200440-TA1593 bp
ProteinDPOGS200440-PA530 aa
Genomic positionDPSCF300236 + 433964-435773
RNAseq coverage83x (Rank: top 64%)
Annotation
HeliconiusHMEL0116020.083.58% 
BombyxBGIBMGA008898-TA0.085.09% 
DrosophilaCG3238-PA0.061.51% 
EBI UniRef50UniRef50_B4MU310.060.04%GK23961 n=5 Tax=Endopterygota RepID=B4MU31_DROWI
NCBI RefSeqXP_002051875.10.060.53%GJ24695 [Drosophila virilis]
NCBI nr blastpgi|1953863660.060.53%GJ24695 [Drosophila virilis]
NCBI nr blastxgi|1953863667e-18060.08%GJ24695 [Drosophila virilis]
Group
KEGG pathway 
InterPro domain[89-387] IPR0102857.4e-65DNA helicase PIF1, ATP-dependent
Orthology groupMCL15102 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200440-TA
ATGACTGGCGATAAAGAGCAAGGCATTAATAAAACACCATCGAAGCAAAGTGTGAGAACGAAATTATTAAGTGGCAAGGCTAAGGCATTTGAAGATATAAGCCCAGTTACGGTAGCGGATATATGCCAGGCTAAAAGTAAGATCAGTAAGGCGACTACAACAACTCCGTCTCCACCTTCCAAGAAGAGAAAGTTTGATGAAGCAATCAAAGGACCAGCTCCAAAGAAACTGTACTCTCCTTCACCACTTTCAACCACAAGTTCATTAAATGAAGAACAAACACGAGTGTTGGAAGCTTGTCTTAGTGGAAAAAATATATTCTTTACTGGTTCTGCAGGCACAGGGAAGAGTTTCTTGTTGAAAAGAATAGTAGCAGCTTTACCACCAGACGTAACAATGGCCACTGCTTCCACTGGAGTTGCCGCTTGTCATATAGGTGGGACCACCCTTCATGCATTTGCTGGCATTGGAGATGGTAGTGGTACTGTAGAAAAGTTGTGTGAGCGAGCAATAAAATCACCACTAGTTGCACAAAAGTGGAGAAAATGCAAACATCTTATAATAGATGAAATATCTATGGTTGATGGATTGTATTTTGAGAAGTTAGAAGCAGTAGCAAGACATGTACGGAAGAATAGCAAACCCTTCGGAGGTATTCAACTGATTTTGTGTGGAGATTTTCTTCAGTTACCGCCTGTTGTTGATAAAAATAAATCGGCAAAAAGATTTTGTTTCCAAACATCTTGTTGGGATAAATGTATAAAATTGTGTTTTGAACTGAAACAGGTTCATCGACAGACTGATCAAGAATTTATATCTATCCTGAACAATATTAGGATAGGTCGTGTTACAAAGGAAATCAGTGATCGCCTTTTAAAAACAGCAGCCCAGAAGATTGAAAGTGATGGAATTCTTGCCACAAGATTATGTTCACATACAAATGATTCAAAATCAATAAACAATTCAAAATTACGAGACCTGGAAGGCGAAGAAAAAATATTCTCAGCTCAAGATAGTGATAACGCCAGCACACTGCTCGACATGCAAACTATTGCACCATCAAAATTAGTTTTAAAAGTCGGTGCACAGGTGATGTTACTGAAAAATATTAATGTTAATGCCGGTCTAGTGAACGGAGCTAGAGGTGTAGTCGTTAGATTCGACGAAGGACTTCCGGTTGTGAGATTTAAAAACAAAAAAGAATACACAACTCGTACCGAACGTTGGTATGTAAAAAATTCTAGTGGTTCATTGTTCTGTAGACGCCAAATACCTTTAAATTTGGCATGGGCATTCTCAATACATAAATCTCAAGGTCTAACATTGGACTGCGTGGAAATGTCCCTCTCTAAAATATTTGAGGCGGGTCAGGCTTATGTCGCACTTAGCCGAGCTCAAAGTTTAGATACACTGAGAGTGTTAGATTTCGATTCACGGCATGTTTGGGCTAACACTGATGTCTTAGAATTTTATCAAAGATTTAGACGACGTTTACAACAAATGGAAATAGTACCTCTTGGGAGACCATTATCAGATAAGACAAACAAAAAAGTCAAACTTAGGGAAATTTTAGCAAAACAATTGAATAAATAG

Protein sequence:

>DPOGS200440-PA
MTGDKEQGINKTPSKQSVRTKLLSGKAKAFEDISPVTVADICQAKSKISKATTTTPSPPSKKRKFDEAIKGPAPKKLYSPSPLSTTSSLNEEQTRVLEACLSGKNIFFTGSAGTGKSFLLKRIVAALPPDVTMATASTGVAACHIGGTTLHAFAGIGDGSGTVEKLCERAIKSPLVAQKWRKCKHLIIDEISMVDGLYFEKLEAVARHVRKNSKPFGGIQLILCGDFLQLPPVVDKNKSAKRFCFQTSCWDKCIKLCFELKQVHRQTDQEFISILNNIRIGRVTKEISDRLLKTAAQKIESDGILATRLCSHTNDSKSINNSKLRDLEGEEKIFSAQDSDNASTLLDMQTIAPSKLVLKVGAQVMLLKNINVNAGLVNGARGVVVRFDEGLPVVRFKNKKEYTTRTERWYVKNSSGSLFCRRQIPLNLAWAFSIHKSQGLTLDCVEMSLSKIFEAGQAYVALSRAQSLDTLRVLDFDSRHVWANTDVLEFYQRFRRRLQQMEIVPLGRPLSDKTNKKVKLREILAKQLNK-