Monarch geneset OGS2.0

DPOGS206415
TranscriptDPOGS206415-TA3030 bp
ProteinDPOGS206415-PA1009 aa
Genomic positionDPSCF300181 - 14898-30371
RNAseq coverage38x (Rank: top 73%)
Annotation
HeliconiusHMEL0075970.063.36% 
BombyxBGIBMGA013851-TA0.057.42% 
Drosophilamle-PA3e-8738.22% 
EBI UniRef50UniRef50_UPI000224708C7e-11837.24%UPI000224708C related cluster n=1 Tax=unknown RepID=UPI000224708C
NCBI RefSeqXP_001600929.19e-11937.24%PREDICTED: hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|3454897312e-11737.24%PREDICTED: putative ATP-dependent RNA helicase DHX30-like isoform 2 [Nasonia vitripennis]
NCBI nr blastxgi|3454897313e-11637.48%PREDICTED: putative ATP-dependent RNA helicase DHX30-like isoform 2 [Nasonia vitripennis]
Group
Gene OntologyGO:00055242.7e-18ATP binding
GO:00043862.7e-18helicase activity
GO:00036762.7e-18nucleic acid binding
GO:00080268.6e-06ATP-dependent helicase activity
KEGG pathway 
InterPro domain[317-502] IPR0140015.2e-19DEAD-like helicase
[562-665] IPR0016502.7e-18Helicase, C-terminal
[335-477] IPR0115458.6e-06DNA/RNA helicase, DEAD/DEAH box type, N-terminal
Orthology groupMCL17432 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206415-TA
ATGTTTGTACGAAGATATTTCAAATCAGTAAATCATAGTCAATTACTCTCTGAGTATAATTCCTATAGAAGTGGATACAAAGGAATTAAATTATGCCAAGAAAGTAAATTGGTAAATTACAGTTCCAATGCACAAATTAAAGAATGTTTATTCCAAAATTATAGCATAGGAACATTATATTCAAAAAGATATTACTCAAAGAAATTTATTGAAGAACTATTTCATGAACAAAACAGTGAAAATGAAAAAAAATTTTCAAAAGACCTATTCAGCAATCCACGAGCTACACTCAACGAACTCGCATCTAAGGTCCCAGAGAAAATATTTGACATACATTTCAAACAAACTATTGTCGCACCTAAAGGTATCAAGAAAAAACCCATACAAAATGACTGGATATGCACATACACATTTATTTGGCCAGAAAAGATGAAATTTGAAAGTGCAGCGATATCTAAACGACAGGCAGCTGACAAATCGGCCACACAAGCATTACATTGGCTTTATAATAATAAACGTATAGATATCAACGGTAAACCTATTTACAACGAGAACACGCTCAAGGAATTACAGAGCACATTAAATAATCCGTTAAACGCATCAATAAGTGAGAATTCGTTAGAACGTATCACAAGGATTTGGGAAGATTATGAAAGAGATATAAGTAAGTATTTATATGAACAAAAATTAATAAAACACATCTATGAAAGGACATTTGATGAAGCCAAACAAGCTTTGAATGTTACGACGATTCACAAAGATTCAACTTTAGATGAGACAGATTGTTCTGAAGACGTGTCAGAACAGGAGAACATAGCGGATGAACTAACAGATACAAGAACAAACATACATCCAGTTTTCGGGAAACCGGTGAAGCCCACGGCACAAGCGTTAGCGAGACGCGAGCGAACACTGAGACACACATTCAAAAATTACGACGAAGAGTTAACACCGCTACCTATAGACGAGTATTCCAATGACATAACATCAGCGTTGGATGACAGTCGCGTGTTAGTAATAATTGGCGCTGCGGGGTGTGGAAAATCGACTAGAGCACCCGTCGCAGTCCTAAGACAGCTCTGCGACAAAATGAACGCAATCGTGTCGCAGCCGCGACGTGTCGCAGCTATCGGGCTTGCGCAACGCGTGTCTGACGAGTTAGGCGAAAAGGTCGGTGAAACTGTTGGTTATCAAGTCCGTTTGCAGTCTGTGCCACCCAGACCTCCCGGCGGCGCCATCTTGTATTGCACTTCCGGTGTCTTATTAAAGAGGTTGCAGATGAATCCAGGTCTTGAAGGTTGTACCCACGTGTTCATAGACGAAGCACACGAGAGAGATGTTAATACAGATATAACGCTGTTGCTACTGAGACGGGCCTTGGACATAAATCGGCACCTGAAGGTGATCGTCATGAGCGCCACCCTCGATACAGGAGTCTTTACTAGATACTTCGACGACTGTCCGGTCATCCAGGTCCCCGGGAGAACATTCCCGGTTGAAATTTCGCATTTACCAGATATAGAGAAAAGATTCAATATAAGACTACCCTCAAGCTTGGAGAGCTGCAGAAAAGTTGGAAAGCCACAGATCAATTGCCAAGAAATAGTCCAAGTCATTAAATCCATAGACAATACTTGTCCCGAGGGCGCCATTCTAGTGTTCCTCCCCGGCTGGGCCGAAATCAAGCAAACTCAGCAGCTATTACAGGACCAGTACAAGGATTCGCCTCTACACATGATATTCCCGGTACATTCAAGGCTATCAACATCAGAACAAACGAAGATATTCTCAAAGTGCCTCGGTATCCGCAAGATAGTACTAGCCACTAACATAGCCGAGACATCTATAACAATACCTGACGTGGTTCACGTCATAGACAGCGGGATACACAGGGAGAATAGACTGCGAGATACTACTAATATCAGTAGCTTGGAAACAGTTTGGGCGTCTAAAGCTAGCTGTACACAGAGAGCGGGGCGAGCGGGGCGTGTTAAACCCGGTCATTGTTACAAAATGTATACCAAAGAAAAGGAAGAAGAATTCCAAGCTCACACTACTCCGGAGATATTGAGAGTCCCTTTAGAACAAACTGTATTGGATTGTAAAACCTATGCCCCAGATGATAGAGTCGAAGATTTCTTATCTCAACTCCCGGAACCGCCGAGCGATAAGGCGGTTCGATTTGCGGTCAATGACCTCGTGGATTTGGGTGCGCTCACCCATAACCAAAAATTGACTCGCCTGGGCGCAATACTATCAAGGGTCAGCATACACCCGCGTTTGTGTTTCAGCGTTTTAAACGCTGCGTTTATTGGAAATATAATAGCGGGCGTGCGGACCGCTCTCGCCACCGAACAAGAGTTCTTCGAAGACTCCGGAGATAGGAGGAACGGTCTTGAAGGTTGTACCCACGTGTTCATAGACGAAGCACACGAGAGAGATGTTAATACAGATATAACGCTGTTGCTACTGAGACGGGCCTTGGACATAAATCGGCACCTGAAGGTGATCGTCATGAGCGCCACCCTCGATACAGGAGTCTTTACTAGATACTTCGACGACTGTCCGGTCATCCAGGTCCCCGGGAGAACATTCCCGGTTGAAATTTCGCATTTACCAGATATAGAGAAAAGATTCAATATAAGACTACCCTCAAGCTTGGAGAGCTGCAGAAAAGTTGGAAAGCCACAGATCAATTGCCAAGAAATAGTCCAAGTCATTAAATCCATAGACAATACTTGTCCCGAGGGCGCCATTCTAGTGTTCCTCCCCGGCTGGGCCGAAATCAAGCAAACTCAGCAGCTATTACAGGACCAGTTCAAGGATTCGCCTCTACACATGATATTGCCGGTACATTCAAGGCTATCAACATCAGAACAAACGAAGATATTCTCAAAGTGCCTCGGTATCCGCAAGATAGTACTAGCCACTAACATAGCCGAGACATCTATAACAATACCTGACGTGGTTCACGTCATAGACAGCGGGATACACAGGGAGAATAGACTGCGAGATACTACTAGTGAATAA

Protein sequence:

>DPOGS206415-PA
MFVRRYFKSVNHSQLLSEYNSYRSGYKGIKLCQESKLVNYSSNAQIKECLFQNYSIGTLYSKRYYSKKFIEELFHEQNSENEKKFSKDLFSNPRATLNELASKVPEKIFDIHFKQTIVAPKGIKKKPIQNDWICTYTFIWPEKMKFESAAISKRQAADKSATQALHWLYNNKRIDINGKPIYNENTLKELQSTLNNPLNASISENSLERITRIWEDYERDISKYLYEQKLIKHIYERTFDEAKQALNVTTIHKDSTLDETDCSEDVSEQENIADELTDTRTNIHPVFGKPVKPTAQALARRERTLRHTFKNYDEELTPLPIDEYSNDITSALDDSRVLVIIGAAGCGKSTRAPVAVLRQLCDKMNAIVSQPRRVAAIGLAQRVSDELGEKVGETVGYQVRLQSVPPRPPGGAILYCTSGVLLKRLQMNPGLEGCTHVFIDEAHERDVNTDITLLLLRRALDINRHLKVIVMSATLDTGVFTRYFDDCPVIQVPGRTFPVEISHLPDIEKRFNIRLPSSLESCRKVGKPQINCQEIVQVIKSIDNTCPEGAILVFLPGWAEIKQTQQLLQDQYKDSPLHMIFPVHSRLSTSEQTKIFSKCLGIRKIVLATNIAETSITIPDVVHVIDSGIHRENRLRDTTNISSLETVWASKASCTQRAGRAGRVKPGHCYKMYTKEKEEEFQAHTTPEILRVPLEQTVLDCKTYAPDDRVEDFLSQLPEPPSDKAVRFAVNDLVDLGALTHNQKLTRLGAILSRVSIHPRLCFSVLNAAFIGNIIAGVRTALATEQEFFEDSGDRRNGLEGCTHVFIDEAHERDVNTDITLLLLRRALDINRHLKVIVMSATLDTGVFTRYFDDCPVIQVPGRTFPVEISHLPDIEKRFNIRLPSSLESCRKVGKPQINCQEIVQVIKSIDNTCPEGAILVFLPGWAEIKQTQQLLQDQFKDSPLHMILPVHSRLSTSEQTKIFSKCLGIRKIVLATNIAETSITIPDVVHVIDSGIHRENRLRDTTSE-