Monarch geneset OGS2.0

DPOGS215077
TranscriptDPOGS215077-TA3936 bp
ProteinDPOGS215077-PA1311 aa
Genomic positionDPSCF300187 - 234261-241763
RNAseq coverage217x (Rank: top 45%)
Annotation
HeliconiusHMEL0105360.071.28% 
BombyxBGIBMGA007175-TA0.071.10% 
Drosophilabgcn-PA1e-8125.81% 
EBI UniRef50UniRef50_E2BZ930.050.31%Uncharacterized protein KIAA0564-like protein n=17 Tax=cellular organisms RepID=E2BZ93_HARSA
NCBI RefSeqXP_392558.20.050.48%PREDICTED: similar to YTH domain containing 2 [Apis mellifera]
NCBI nr blastpgi|3071848690.049.70%YTH domain-containing protein 2 [Camponotus floridanus]
NCBI nr blastxgi|3838607200.050.57%PREDICTED: probable ATP-dependent RNA helicase YTHDC2-like [Megachile rotundata]
Group
Gene OntologyGO:00043861.8e-20helicase activity
GO:00055244.8e-12ATP binding
GO:00036764.8e-12nucleic acid binding
GO:00080263.3e-09ATP-dependent helicase activity
KEGG pathway 
InterPro domain[162-361] IPR0140011e-22DEAD-like helicase
[707-800] IPR0075021.8e-20Helicase-associated domain
[837-959] IPR0117093e-14Domain of unknown function DUF1605
[419-478] IPR0206837.7e-14Ankyrin repeat-containing domain
[545-643] IPR0016504.8e-12Helicase, C-terminal
[1160-1287] IPR0072758.6e-12YTH domain
[36-82] IPR0013742.4e-09Single-stranded nucleic acid binding R3H
[170-324] IPR0115453.3e-09DNA/RNA helicase, DEAD/DEAH box type, N-terminal
Orthology groupMCL13209 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215077-TA
ATGAGTTCTAAAAAGGGTAAGCGTAATTTAAGCAGTAAGCACGCCCCAATAGCGGAATCGGTCAGTATAGCCTTAAAGATACAGCTGGATAAATTTCTAAGTGATGATAATGAAACAGAACTCAAGTTTCCATCGTTCCTATCAGCTCAAGAGCGAAGGTTTATCCATGAAACCGTTGCCAAATTAGGACTCAAGTCTAAATCGCGCGGGAAAGGGGTTAATCGCTACATAACAATATATAAAAGAATTGGCTCAACTATTGTACAAAACGATGCCAAATTGATACTTGACTCGAATATGAGATGTAGTATTACGGAGTTGTTCAATACATTTCCAATCACTAGCAAAGAGAAGGATGATTTAAGCTGTTGCCCTGAAAAAGAAAGAAGTCCTCAATTAGCGCACAAGTCTTTGGGACAGTTAAATAATGGGGTCGCTCAGATACCAAACACAACATATAACCCAGAGTTAGTGAAGTTCCGAGAGAAGCTACCGGTGTATGACCAGCGACAGGAGCTCCTAAATGCTATCCAAAATAATCAGGTCATAATAGTGGCGGGTGCTACAGGCTGTGGTAAGACTACACAGTTACCACAGTTGATCTTGGACTACTGTCAAGAGAATCATCAACCGGCGAGGATTTATTGCACACAGCCAAGGAGAATTTCAGCTGTGTCTGTCGCTGAAAGGGTGGCGTTTGAACGAATGGAGAAGATTGGTCAGTCCATTGGCTATCAAATACGCCTGGAGTCCAGAGTGAGCCCACGGACAGTACTCACATATTGCACCAATGGAGTGCTACTGAGAACATTGATGGCGGGAGATACGGCTCTAACAGGTGTCACTCACGTCCTGGTGGACGAAGTTCACGAGCGAGATAAGTTCAGTGACTTCCTTCTGATAGCTCTAAAGGACTCGCTCACGAGGATGAAGGATCTCAAGCTGTTACTCATGTCGGCTACCATGGACACACAAATATTCTCCAGGTCTATATGTAGCATATATAACCTACCTTATATAGATGAACGAAGAAGTTTCCAATATCTCGATTCTGCTCAAACGAAACACAAATTAAATCAGGAGGAGAAAGAAGTGAAGGATGAGAAGGAGGTGCCAATAGATCCGGTGTTGCAAGCTGATATGGATGAATTCCTTGACGAGTGTTTCAATGAAGGCAGTCTGGACTCGTTCAGTCAGGTGTTATACATGGTGATGTCTGAGGGGGTTCCCGCGTCGTTGCCGCACTCTCGCTGTGGAAGGACGGCCCTGGTTGCCGCCGCCACCCACGGCCTGTCGCATGTCGCGTTACAGCTGCTCAGGATAGGTGCTGATCCAACAGCAAAAGATAAGAGTGGCAAAACACCGTACGACCACGCCGTAGAGAACGGCCACCACGAGTGTGCCAAGATACTGTCGACGTTCAATCACGACGATGAGAACAAGAACAATCAAGAAGAAGACGACCCAGAAGATAACTTCCTGCTGGACGTCTACTACCACGCCTTCTCAGAGGAACTGATAGACCACGATCTCATGTTGGCGTTGGTGAAACATATACACCTGACCATGGCGAAGGGCAGCATCCTGATCTTCCTCCCGGGATACGACGATATAGTCACTCTCAGAGACCTCATACAAGGCTGCTCGGAGATGAACACCATGAAGTTTCAAATATTCACATTACACAGCAACATGCAAACTTTGGACCAGAAGAAGGTTTTCAATCCGCTGCCGAGCGCGAGGAAGATTATAATATCGACCAACATAGCGGAGACCTCTATAACCATTGACGACGTGGTCTACGTCATAGACTCGTGCAAAGTGAAGGAAAAGTATTACGAGTCCGATGGGGGGGTTTGCTCGTTGCAATGCGTATGGACTTCCAGAGCGTGTTGCCAACAACGGGCCGGCCGCGCGGGCAGGACTAAACCGGGCCTCTGCTACCACATGTGTTCCAAACGACGCTTCAAAACACTACCTCTGAACTCGATACCGGAAATACTAAGAGTTCCCCTCCAAGAGTTATGTCTCCACACAAAATTGTTGGCGCCTGGCAACACTCCGATAGCGGATTTCCTATCGAAGGCTCTCGAACCTCCGTCGTTTTTGGCTGTTCGTAACGCTGTGACGTTGCTCAAGACCATCGGCGCGCTCACTCCGATGGAAGATTTGACGGAGATCGGACAACACCTGTTGGATCTGACTGTGGAACCCAAACTGGGGAAAATGTTGCTGTACGCTTGCGTCATGAAATGCCTGGATCCCATATTGACGATCGTATGCAGTTTGGCCAACAAGGAACCGTTCCAGATCTCGCTCAACCCCGAGAACAGAAAAAAAGGCAGTTCGGCGAGGAAGGAGTTCGCAGCTGACAGTTACTCAGATCACATGGCGTTACTGCGGGCCTTCCAAGCGTGGCAGATGGCGCGAGCTAACGGCGCGGAGAAAGCTTTCTGTTCGAGAAATTTAATATGCGGCGCGACCATGGAGATGATCGTAGGCTACAGGTCACAGCTGTTGGCGCAACTCCGAGCTCTGGGTCTCGTCAAGGCTCGAGGCTCGGGCGATATAAAAGACGTGAATTTGAACTCGGAGAAGTGGCACGTTGTTAAAGCGGTTCTGGTGAGCGGACTCTATCCTTCCATAGCGAGGGTCGACAGAGACACGTCCACACTGAGGACGTCCAAGGAAGTGAAGGTGGCGTTCCATCCGAGCTCGACGCTGCATCGCGGCGGGGGAGTGTCGGGGTCGCAGAAGTCGGTCCAGAGCCTCCCGACCGACTGGGTGGTGTTCGAGGAGATATCTAGAGTTGGAAGGTTTTGCTTCATTCGCTGCAACACCCTCGTCACGCCGTTGACCGTCGCGCTGTTCGGGGGGCCGTTGAGAGTTTCGACCGCAGCGCTGTCACAGAGGGCGAATCCACCAGGGCTGTCCAGTGACTCGGACAGCGAAGTCGAAGAGAGCAATTCCTCTCCAGACACGGCCATCCTGACGCTGGACGACTGGATGGCGTTCACAGCGGACGCGTCGGACGCGATTAGCGTGTACTACCTGCGGCAGAAGCTGTGCGCCCTAGTCATACGCAGGATGTCCAACCCGGCGAAGCCCATGACACCGCTGGACGAACAGATACTCAACGCCGTGGTGCACGTGCTGGGAGCGGAGGAGAAAAACATAGGGCTCAATCAGCCGACCGGCATCGGACAGAGACCGAAACCGCTCACGTTGGATTCACCGAACTGGCGGATGAGGGTCGACGAGGAACAGTACCGGCACGAGAACAAGTACCACCCGCAGTACAGCCACGCGGAGCTCGCGAATAACTTCTACCAGAATAACTACGCGTACAGGAACGGCCAGAGGAGCGGGTGGAGGGAACCGGCCCAGGGCTTCTCGCCGCAGTACGGAGGACCGAAGAGTCCGATGTCACCGGCGAAAGCTATGGGAAGCATGGAAAAATACGTCGACAGCGAAGCAAACACGCGCTACTTCGTGGTGCGAGCCGACGAAGTGCATTCCGTAGAAGCGCTGCAAGCGAGCGCCGGCGGGTTGTTCAACCAGAACACCGCCAAGAAACTCGTGAAGATTAAACAGGAGGGTAGCCGCGTGGTTGTCTTCTTCTCATGTTCGGGTGCTTCCAAGTTCATCGGCGCTGCCACCATCACGGACGGCAGTGGAGCGACTACAGCCAGTGGTACCAACAACAACACGCCGACGCTGGAATGGCTGTCCACACAACACGTGCCGTATCACATGGTCCGTCACATCGGCAACTCTCTGACGGGCGGCGGACGTGTGTCGTCCTCCCGGGACGGGACGGAGCTGTGCGGGCCGGCGGGGCGGGCCCTGCTCGCCGCCCTCACCGCGCGACGGCGGCACGGGTACGGACACCACGCACACCCGCGACCCATACAGAAACACGCCGCCGACGGCTAG

Protein sequence:

>DPOGS215077-PA
MSSKKGKRNLSSKHAPIAESVSIALKIQLDKFLSDDNETELKFPSFLSAQERRFIHETVAKLGLKSKSRGKGVNRYITIYKRIGSTIVQNDAKLILDSNMRCSITELFNTFPITSKEKDDLSCCPEKERSPQLAHKSLGQLNNGVAQIPNTTYNPELVKFREKLPVYDQRQELLNAIQNNQVIIVAGATGCGKTTQLPQLILDYCQENHQPARIYCTQPRRISAVSVAERVAFERMEKIGQSIGYQIRLESRVSPRTVLTYCTNGVLLRTLMAGDTALTGVTHVLVDEVHERDKFSDFLLIALKDSLTRMKDLKLLLMSATMDTQIFSRSICSIYNLPYIDERRSFQYLDSAQTKHKLNQEEKEVKDEKEVPIDPVLQADMDEFLDECFNEGSLDSFSQVLYMVMSEGVPASLPHSRCGRTALVAAATHGLSHVALQLLRIGADPTAKDKSGKTPYDHAVENGHHECAKILSTFNHDDENKNNQEEDDPEDNFLLDVYYHAFSEELIDHDLMLALVKHIHLTMAKGSILIFLPGYDDIVTLRDLIQGCSEMNTMKFQIFTLHSNMQTLDQKKVFNPLPSARKIIISTNIAETSITIDDVVYVIDSCKVKEKYYESDGGVCSLQCVWTSRACCQQRAGRAGRTKPGLCYHMCSKRRFKTLPLNSIPEILRVPLQELCLHTKLLAPGNTPIADFLSKALEPPSFLAVRNAVTLLKTIGALTPMEDLTEIGQHLLDLTVEPKLGKMLLYACVMKCLDPILTIVCSLANKEPFQISLNPENRKKGSSARKEFAADSYSDHMALLRAFQAWQMARANGAEKAFCSRNLICGATMEMIVGYRSQLLAQLRALGLVKARGSGDIKDVNLNSEKWHVVKAVLVSGLYPSIARVDRDTSTLRTSKEVKVAFHPSSTLHRGGGVSGSQKSVQSLPTDWVVFEEISRVGRFCFIRCNTLVTPLTVALFGGPLRVSTAALSQRANPPGLSSDSDSEVEESNSSPDTAILTLDDWMAFTADASDAISVYYLRQKLCALVIRRMSNPAKPMTPLDEQILNAVVHVLGAEEKNIGLNQPTGIGQRPKPLTLDSPNWRMRVDEEQYRHENKYHPQYSHAELANNFYQNNYAYRNGQRSGWREPAQGFSPQYGGPKSPMSPAKAMGSMEKYVDSEANTRYFVVRADEVHSVEALQASAGGLFNQNTAKKLVKIKQEGSRVVVFFSCSGASKFIGAATITDGSGATTASGTNNNTPTLEWLSTQHVPYHMVRHIGNSLTGGGRVSSSRDGTELCGPAGRALLAALTARRRHGYGHHAHPRPIQKHAADG-