Monarch geneset OGS2.0

DPOGS212047
TranscriptDPOGS212047-TA4749 bp
ProteinDPOGS212047-PA1582 aa
Genomic positionDPSCF300054 + 512722-526879
RNAseq coverage348x (Rank: top 34%)
Annotation
HeliconiusHMEL0129750.069.11% 
BombyxBGIBMGA010194-TA0.077.42% 
DrosophilaIno80-PA0.060.21% 
EBI UniRef50UniRef50_UPI00020641BB0.056.45%UPI00020641BB related cluster n=3 Tax=unknown RepID=UPI00020641BB
NCBI RefSeqXP_393832.30.057.01%PREDICTED: similar to CG31212-PA [Apis mellifera]
NCBI nr blastpgi|3800161170.056.48%PREDICTED: LOW QUALITY PROTEIN: putative DNA helicase Ino80-like [Apis florea]
NCBI nr blastxgi|3287868740.056.88%PREDICTED: putative DNA helicase Ino80-like [Apis mellifera]
Group
Gene OntologyGO:00036771.3e-83DNA binding
GO:00055241.3e-83ATP binding
GO:00043862.2e-21helicase activity
GO:00036762.2e-21nucleic acid binding
KEGG pathway 
InterPro domain[470-771] IPR0003301.3e-83SNF2-related
[463-661] IPR0140013.9e-34DEAD-like helicase
[1119-1202] IPR0016502.2e-21Helicase, C-terminal
Orthology groupMCL13934 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212047-TA
ATGTCTGAAAAGTCTTTAGTGGGAAAAACGGCGTTAAAGCGTCATGAAGTTGCAAGACCTATACACATTCAAAGACTAGAGGCAGCTCTAGACATAAGACCCTTCGTAAATCAAGTGGAAGGGATACTGAATTCTGATGGGGACTCTGACAGTGAAGAAGATTCAGATGGCAAGTCTATGGCAAATGAAATGTCCAAAGGTGATAAGGACATGCATGGTGTTGTAACTACAAGACAAGAGAGGCAAAGCGATCAATTAAGACTATATAATTTGTCATCTGTGGGCGAGGAGAGGCAATGGTTAAGAGATGTCCTGCTGTCATCGGAATCGGAATCAAGTAGTGAAGATGATACACCAGCTGCTAAGGAAAGAAGGATAAAACTGTTACTTAAAGAGAGATATTTTCATAATAAATATGCAAAGAGTTACTATAAGGATTCGGGGAACTCCCGCTACATGTACTATGGTGCGGGGTTATTATCTACTATCGACCGTTATCCTGAGCGGAGGATCAGTAGGCCTTTGCCCCCCGCACGAGGGCGCCGTGGCCGACCGGCCGAGCGGGGCTCCTCTAGGGGCCTCCAACGAAACAGGCGCGGCCGCGCCAAAGAGGGCCGTACTGACGGTAGCGATGCTCCAGATGGCGTTGATTGGGAGGAACAACTTCGTAATCTCAAAGAGGAAAATGATACAGATGAATTTACCATATCAGAGAAACCTGGACCCAGAGGTCGCAAAAAGTTATCATCCTTGAAAAGCCCAGAAGCCTTAACGGCTAGACGGAGGCGACACTGGCAGCTTTTAGTTAAAAAGGAGCTTGGGAAAGTTCAGAGATCAAGGACCGCGACACACAGGGAACTCATGCTGCAAAGGAAAAGATTAGCTACACTTTGCTGCAAGCATTGGAGGCATGTTGCAATGCAATCCCAGAAGAATATGAAGGAGACGGTATGGCGGTGTAAGCGTCTAAGTCGGGAGATGCAGGCTTACTGGAGGAGGTATGACCGAGCTGAGAGAGAGACCAGGAGGAGGCTGGAGAGAGAGGCGGAGGAACAGAGAAAGATGGATGTGGAGCTGATGGAAGCGAAACGACAACGTCGCAAACTTAACTTCCTGATCACACAGACAGAACTGTACGCTCACTTCATGCAGCGGAAGTTGAACGCGACGGAAGACGTGGATGACGACACCGACCGGATATTGATGCAGTTGGACGAGGACAGAGATCCGAGGCTCTCGGCCATAGATAATTATGATAGTGAGGCAATGAAAGAGTTGGCGTCGCGGAACGCCCGCGAGGCGTTTCAAGCGGAGAGGGCTCGGACCTCCGCCCCGGAGGGAACAGACGAGAAAGAGAGACGGAGGGATCATGATCAACCAGAAATATTCAGGGGCACGTTGAAAGGATACCAGCTGAAAGGAATGAACTGGCTCGCTAACTTATATGACCAGGGTATAAGTGGTATATTAGCGGATGAAATGGGTCTCGGTAAAACTGTCCAATGCATAGCGTTTCTATGTCACGTGGCTGAGAGGCTAGGCGTGTGGGGGCCGTTCCTCGTCGTGTCGCCCGCATCAACATTACACAACTGGCAACAGGAGATGCAGAGATTCGTTCCGGATTTCAAAGTTGTTCCATACTGGGGCAGTCCGAGTGAGAGAAAGATCTTACGTCAGTTCTGGGAACGCAAAGACCTGCACACACCGCAGGCTGCTTTCCACGTGGTGGTGACGTCATACCAGATTGTTGTATCGGATTTGAAGTATCTGAACAGAGTGTCCTGGCAGTACATGATATTGGATGAGGCTCAGGCCATCAAGAGTTCAGCCAGCATGAGGTGGAAGCTGCTGCTAGGTTTCAGTTGCAGGAATAGGTTATTGCTATCGGGTACACCAATACAGAACAGTATGGCGGAACTGTGGGCGTTACTACATTTTATAATGCCTACGTTGTTCGACTCGCACGATGAGTTCAACGAGTGGTTTTCAAAGGACATTGAGAGTCATGCAGAGAATAAAACGACCATAGATGAAAAACATCTATCACGACTGCACATGATTTTAAAACCTTTTATGTTGCGTCGGATAAAGAAGGACGTCGAAAATGAATTGTCTGATAAAATTGAGATTATGGTACACTGCCCTCTGACGATAAGACAGAAATTATTGTATATAGCTCTAAAGAAAAAAATTAAAATAGAAGAATTATTACATTACTCTGTCGGAGGAGAGTCGGGTCATAGTGTAGATAAGAATTTCACCTCGAATCTTATGAATTTGGTTATGCAATTTAGAAAGGTATGCAATCATCCGGAACTGTTTGAGAGAAGGGATGTCAGGTCGCCTTTCGCCATGCAAGTCGACGATTATCATTTACCAAAGTTATTAGCTGAAGAATGTATCCTAGTGAGGTCCATACCATCCAAGCGACATCTGCTATACAACAAGTTGAGTGTCCTCAACCCCGAGTACGTTCATCACAATACAGAGAGCTTCAGCTTCATGAGGTTCATGGATTTATCTCCCATGGACATGTATAGGATTATGCTGTGTGGATGGCTGTATACGATGTTACATATAGAAGAATGTTTAGATAAATACGAAAAGCTTCGCCACAGACAGTTCTGGTGGAGGCGGATAGAAGAGGCTGGTGTTAAAGAAGAACAGAGCACAGAAGATTTCGGCGACTATTTTGACAGAGTTCATTTAAAACACTTATTACTTATCAGTCCGAATGGAGATACACTCAAGAGAAGGAAAGAGGACTCTTTGTTATGGAGGGAATTTCTATTCACTGATCCTCTATGGGGAGAAGGAGGTCCGTTTTACATTCACACCGAGCAAACCATACATTACATGCCTGAAACTGTCGAACATCGAGGGCTACGGACGAGGAATATGAAGTGTGAGGCGCAGCCGGAAATGCTGAGCGAGATCAAGACGGAGGAAGGCGGTGTGGTCCCTGTGGAGCCCCGCGCGGCGGAGGCGGTGGAGTTCCCTCACACGGAGCGTGCGCCCCGCGTTATGGAGCACCTTCAGACCCACATTCCGGCCTTCCTGTGTACCGCACAGCAGAAGGCGAGTGCGCATTGTCGCGAAGCGTTCGCCAGCAGTCGTAGCTGGGCGCACTCCCAAGAGAGACATCGGAGAGGAGAAAATGACGAGGGAGCGGCGCTACTACGACGACTGGTGGACGCCGCCCGCCCACCGCGGGGATGGGCGGAGCTACAAGTACCTGATAAAAACCAGTTAGTTAGCGACGCGGGTAAACTGACAGTACTAGACTCGTTATTGAAAAGACTGAAGGAGAGCGGGCATCGAGTCCTTATATACAGTCAGATGACGAAAATGATTGACTTGCTAGAGGAATATATGTGGCACAGGAAACACAAATACATGAGGTTGGACGGGTCTAGCAAAATATCTGCAAGACGCGACATGGTCGCTGATTTTCAGGCTCGCGCAGATATATTCGTGTTCCTGCTCTCGACTCGCGCCGGCGGTCTCGGTATAAATCTAACAGCTGCCGATACTGTCATATTCTACGATTCTGATTGGAATCCAACGGTGGACCAACAAGCTATGGACCGGGCTCACAGACTGGGTCAGACTAAACAGGTCACTGTGTACAGACTTATATGTAAAGGGACTATCGAAGAGAGAATCATGCAACGAGCTAGGGAGAAGAATAAAAACCAGTTAGTTAGCGACGCGGGTAAACTGACAGTACTAGACTCGTTATTGAAAAGACTGAAGGAGAGCGGACATCGAGTCCTTATATACAGTCAGATGACGAAAATGATTGACTTGCTAGAGGAATATATGTGGCACAGGAAACATAAATATATGAGGTTGGACGGGTCTAGCAAAATATCTGCAAGGCGCGACATGGTCGCTGATTTTCAGGCTCGTGCAGATATCTTCGTGTTCCTGCTATCGACTCGCGCCGGCGGTCTCGGTATAAATCTAACAGCGGCCGATACTGTCATATTCTACGATTCTGATTGGAATCCAACGGTGGACCAACAAGCTATGGACCGGGCTCACAGACTGGGTCAGACTAAACAGGTCACTGTATACAGACTTATATGTAAAGGGACTATCGAAGAGAGAATCATGCAACGAGCTAGGGAGAAGAGTGAGATTCAAAGGATGGTGATCAGCGGTGGTAACTTCAAACCGGACACGTTGAAGCCGAAGGAGGTCGTGTCGCTTCTATTGGACGATGAGGAGATTGAATTGAAATATCGTCAGAAATCCGAGGAAAAGAAAAACGAAGAAAAGGAACGCAAAAGGAAGATGGGAGTGTTGCCAGTGCCATCATCCGTGGAGTCGAAGCGTCCTCGTGAGTCGTGTTCGGAGCCCGGAAGTCCGTTACAGGTCGACGATGACAGCGACACCCTCGTTATGGATGACGCACATCCGCTACCGTATTCCCCTCCGGTCCGGGGTAGTTGGTCGTCTCGTTGCCGGCGCGCGGGGTCTCGGCGCGGGCGACCGCGTGGCTCCCGGCACGTGCCCCGGGACAGAAGACCGGACCCGCCCGCTGATAACGCTCAAGCCGTTGCCGCTGGCGCTGCACTGCTGGAATCCGACGCTCCTTTAGAAGCCTTGGAGCCCCCATCATGTGATCGGGGTCGCCGCGGCCCGGGCCGGCCTCGCCTAAGGACCTGTAGGCCTCCAAGACCAGTCACCAGGAGACGCAGGGCCCGTGAGCCGCTCCTAGTACCATTAGCACCTCCACCGTGA

Protein sequence:

>DPOGS212047-PA
MSEKSLVGKTALKRHEVARPIHIQRLEAALDIRPFVNQVEGILNSDGDSDSEEDSDGKSMANEMSKGDKDMHGVVTTRQERQSDQLRLYNLSSVGEERQWLRDVLLSSESESSSEDDTPAAKERRIKLLLKERYFHNKYAKSYYKDSGNSRYMYYGAGLLSTIDRYPERRISRPLPPARGRRGRPAERGSSRGLQRNRRGRAKEGRTDGSDAPDGVDWEEQLRNLKEENDTDEFTISEKPGPRGRKKLSSLKSPEALTARRRRHWQLLVKKELGKVQRSRTATHRELMLQRKRLATLCCKHWRHVAMQSQKNMKETVWRCKRLSREMQAYWRRYDRAERETRRRLEREAEEQRKMDVELMEAKRQRRKLNFLITQTELYAHFMQRKLNATEDVDDDTDRILMQLDEDRDPRLSAIDNYDSEAMKELASRNAREAFQAERARTSAPEGTDEKERRRDHDQPEIFRGTLKGYQLKGMNWLANLYDQGISGILADEMGLGKTVQCIAFLCHVAERLGVWGPFLVVSPASTLHNWQQEMQRFVPDFKVVPYWGSPSERKILRQFWERKDLHTPQAAFHVVVTSYQIVVSDLKYLNRVSWQYMILDEAQAIKSSASMRWKLLLGFSCRNRLLLSGTPIQNSMAELWALLHFIMPTLFDSHDEFNEWFSKDIESHAENKTTIDEKHLSRLHMILKPFMLRRIKKDVENELSDKIEIMVHCPLTIRQKLLYIALKKKIKIEELLHYSVGGESGHSVDKNFTSNLMNLVMQFRKVCNHPELFERRDVRSPFAMQVDDYHLPKLLAEECILVRSIPSKRHLLYNKLSVLNPEYVHHNTESFSFMRFMDLSPMDMYRIMLCGWLYTMLHIEECLDKYEKLRHRQFWWRRIEEAGVKEEQSTEDFGDYFDRVHLKHLLLISPNGDTLKRRKEDSLLWREFLFTDPLWGEGGPFYIHTEQTIHYMPETVEHRGLRTRNMKCEAQPEMLSEIKTEEGGVVPVEPRAAEAVEFPHTERAPRVMEHLQTHIPAFLCTAQQKASAHCREAFASSRSWAHSQERHRRGENDEGAALLRRLVDAARPPRGWAELQVPDKNQLVSDAGKLTVLDSLLKRLKESGHRVLIYSQMTKMIDLLEEYMWHRKHKYMRLDGSSKISARRDMVADFQARADIFVFLLSTRAGGLGINLTAADTVIFYDSDWNPTVDQQAMDRAHRLGQTKQVTVYRLICKGTIEERIMQRAREKNKNQLVSDAGKLTVLDSLLKRLKESGHRVLIYSQMTKMIDLLEEYMWHRKHKYMRLDGSSKISARRDMVADFQARADIFVFLLSTRAGGLGINLTAADTVIFYDSDWNPTVDQQAMDRAHRLGQTKQVTVYRLICKGTIEERIMQRAREKSEIQRMVISGGNFKPDTLKPKEVVSLLLDDEEIELKYRQKSEEKKNEEKERKRKMGVLPVPSSVESKRPRESCSEPGSPLQVDDDSDTLVMDDAHPLPYSPPVRGSWSSRCRRAGSRRGRPRGSRHVPRDRRPDPPADNAQAVAAGAALLESDAPLEALEPPSCDRGRRGPGRPRLRTCRPPRPVTRRRRAREPLLVPLAPPP-