Monarch geneset OGS2.0

DPOGS202835
TranscriptDPOGS202835-TA4074 bp
ProteinDPOGS202835-PA1357 aa
Genomic positionDPSCF300018 + 888067-894218
RNAseq coverage113x (Rank: top 59%)
Annotation
HeliconiusHMEL0062930.042.41% 
BombyxBGIBMGA010508-TA6e-7646.26% 
DrosophilaCG18596-PA2e-3149.24% 
EBI UniRef50UniRef50_UPI00022C9DF41e-4522.68%UPI00022C9DF4 related cluster n=1 Tax=unknown RepID=UPI00022C9DF4
NCBI RefSeqXP_001664217.17e-3933.52%hypothetical protein AaeL_AAEL013974 [Aedes aegypti]
NCBI nr blastpgi|3504150735e-4522.68%PREDICTED: hypothetical protein LOC100745300 [Bombus impatiens]
NCBI nr blastxgi|3485072844e-4125.13%PREDICTED: probable methyltransferase TARBP1-like [Oreochromis niloticus]
Group
Gene OntologyGO:00063967.6e-18RNA processing
GO:00037237.6e-18RNA binding
GO:00081737.6e-18RNA methyltransferase activity
KEGG pathway 
InterPro domain[1227-1348] IPR0015377.6e-18tRNA/rRNA methyltransferase, SpoU
Orthology groupMCL17535 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202835-TA
ATGTATAAAGAAGGAGAGTTATTATCTTTCTTGGATTTGCTAGATTTAGATGAAGAGGTTATAGACAGAAGAGCAAAAAGTATCATGCAACGAAGTACTTTAAGCAGTCAGCACTTAGAACATTTTATTTACCTTCTGCAATACAAGCATTTGATCAACATCAGAGAGAATAGTGAATGCGAAAATGAGGAGGAATTCAACTTTCTTAGTAAACTAATTGTCGACGCAGACAATGAGAATGCGAACGCAACATGTACACTTGTAAATTTAGTATTAACATTAAATCCTTCCACCATAATAAATAAATCCGAGCATCTCTTGCAACAAATATTATCAACATTAAATTTCCCTCCTATTAAGAGGAGCATCATTGAAGATGGTGGTTCAACAAGTAGAGAGACAGAAGACACACTAGTTAAACTAAGGATTTGTGGTAGTGTATTAGATGCTGTGAAGAAAGTTGGAGCTAGGCTAGAGCAAATTATTTGCACTGAAAATAAACAACTACAAGGATTCTTCCTTATGAACACTGTTCCCAAATTTTTTGAGACCGTTGATGATTGTAATATTCTTGATAGAATTTGGAGTTTCGTAAAACAACTAAATGATAGTGCAAATGCTTTAAAAGTTTTATGTAGCCTTTCAAACTATTATTTACCAGTAATTGAAGATGAATCTGAAAATATTTCAATCGAATCTACTATCATAATGGATTCCGAGTTTTGGAATTTTATTCTGAGTGGATTGCTCTGTGAAGCTCAATGTTTATCTTTAAGAAAGCAGGCCATATACCTCGCTAAGAGAGCCATAGACTATGCTGCATATAGGAAAAAAGATATTAAAATCACCTCACACAGCACATTCATATGGTCTAAAGAAAATGAAAAAACATTAAGAAAAATGTGGGAAAACTATTTTATATTAATAGATAGTTTACAAGAGAAACAAAGCAATATTGTTCTACCGTCATTAAAACTTTTTAATAGCATCAACGACATAGGCCACTGTTGGTTGGCGGCCGCTTTTAATATTGGCCTGAAACATGATAATTTCCAAGTGAGACAAAAATGCTTGATCTATAGGTTTCAGTATAGAATAAGTAATCAAATGGAGGCTGTAGGTGTTCTAGAAGCTCTTAACGACCCAGCAATATTTGATAATAGAATTAGACATACCTCAGTTATTACACTACTGAAGGAAGCCCTAACAAATGTTCGAACACTTATAAATTTTCTCCAGGGAATGGCGGATGTCAAGTTGTCACCAGTGCCACTGTTCTCTTTGACATCCTGCTTAGCTGATCTCCATATATTGCTTAGCGATTCTGATCAGAAAACTTTATTAAAAACTATACACAAGATATTAAACACTCCATGCAATAACACAGTGCTAAGAAAGGCCATATGTGTTAACATGTCACACTTTATTGCAAATTGCTGTAAGAATTTACATTGGAAGGAGTATTTATTGTTGTATCCATTACTGGCTGTTGAATGCAACCAGGAATTTATGAATCCATTTGTTAGTTTCATACAGAATGATATGGATGTACCTGAGGATGAGATGGTGCAGTTCATAAATAGTGGAAAGAAATCGCATCTTAATATACAATATGCCCTAATTTACTTCCGAGGTCAAGATAAGCCCTTGTTTCTGGAGATGATCGATGATATGATACATCAAGTCAAGGACACCAGCAGTAGACAGTATTCAAACAAGTTAGACTGTTTAGACGAGGCCATCTTCATCTCACATTTGTATAATTGTCGAGACAATAACACAATAACTCATATATATTACAATTCTGAAGCATTCCAAGCTATAATGCAATATATTGGTAGTTTATTGTCAGATGAAATTAATTTGGATTTTGATAAAATGAATTTACTCTTAGAAGGTTTCGATTATGCATTAAGAGCTGTGAATATTATTAATTATAAGGAAAACTTAATACAGTTATATAAATCAGCTGAGTTTCTCCTAAAAGATATAAACTCTGATCTGCAGAAAAAACTTCTTGCATTACTCACTCTCAATACACTTATGAAATCGAAATTTCGGGACGTTTGTGACGAAGGTCCGGTTATTGAAGCATTTGTAAACATCATTAAGAATATTAATTATTCTGGACAAAAACGAGAAGATGGCGGCAGATTGAGGAACAGTTTTTATGAAAATATATGTCATTTATTGTGTACAGTGTATACAAGTGGATCTATAAAAGATATTATATATTTTATTGATACAGTAGTGGAATGTGGAGGTCACGGCTGTTTGAAATGGTTATTGAATTTAACTAATAAAATAATAACGGAACTGTTAGAGCAAGATAGTGTCAAATTTGATTTAATACAGTTTCTGAATAGAATGTGGAAAGAGATCGAAGACCTGAAGTCAAACAGCCAATATTCAATTTGTATGGAACACTTTATAAATCTGCTCACCCACGATGCTGTGTTAAAGAGACCAATATATAATAACCTCGTTATCTCATACTGCAATAAGATTATTAACTATGCCACATTAAAGTATTCCCCGCTGTATTTTCTTATAAGACGACTGACTCTGATTGATATCTCGTCTTATGGCCACATGATTTATATTTTATGTGAAATTTTGCTATACGCAAACATTCCGAACAAAGAACAAAGGATAGCTGAAAACTTACAAGTGACAATATTAGAGCGATCAGATTTTTTTGGAATAAATGAAGAATGCGTCCAGTTTAATTGTCACATACAATATTTGGCAGTTTCAGTACTTGTTAAGATAAGGGACAATGAAATTCTTGATACCGTGGTGAGGTCAGTGAGGAGAAGGATTGATGAATTGCTGAAAAATAAGCTTCGGTACCACGAGAATTCCTACCTCGAGAGGTCTATAGAATCTTGTTTACAATGTTTGCTCTTCATAGCTCTTATGACCGAGGAGGTCGATTTGAAGGACAGTGCTGTGTGGTGTATGGAGTTATTGGGAAGGATGCCACACCAACCGTATGTCAAGATATGTCTGGAATGGTTTATTTGCTTATACTATTATTTTGAGGCGGAGTTCGATCACGTCATGGAGTTTTTAACCTCACACAGCATGGGCCCGGTCTACGGCGTGCGGCTGAACGCACAGTATCTGGCCACGAAGATTGTTGACATCAATGACAAACAGAAATACAAATTGGATGACAAACAATTCACATACATTATAAGGGTTATAAGGAATTGTCTGCGTCAAGCCCAGGAGTTGGAAGAGAAGAGTCTCATGAAGTTATCGAACAGCTGTTTCATCAATTCGTTTGACATCGTCCAAAGCTTGAACTTCTTCGACGTGTTCTATAAACTGCCCCTGTCGTTAACACACACCGCCAGACATTTCGATATGACCAATAAATTTCTTCAAACTGTGACCATGGACATTGAAGCGTGTTTGGAAACAGGTTTGAGGGGTGAACTCCTGAGTGGTCGATTCGTCGTTCCAGAGAAGATCCGATGTGAGTTTTTAACAGATGTCATTGATGACGAGGAACCTGAGATTATACAAAAAAAATACGTCCCCTGGAACGGTATGAGCGACGTGGACGCTCACAGCGAACGTCATGTAAGAGTTATTATTCATATTAATCAGATTCAAACAGTTGCATCAACAGCCCTCGTAGTTGTAACAGCACCCGACGATGAAACGGGGGAGGTGTTCGGAGTACACACGTACGTCATGGACAGCCTGAGACATCTACAGAACAGAATGTTCCAGGACCTGAGCGTGTCAGCTGAGCGGTGGCTGAACGTGGAAGAGGTCCGCCCGGGAGAGCCTCTCAAGCGATATCTCATGACGAAGAAGGCCGAGGGACACATCGTCGTAGCTGCGGAACAGACTTCTAATAGCGTCAAACTGCAACATTTCAAATTCCCCAAGAAAACCATCTTGATGCTGGGACACGAGAAGGAGGGCGTCTCGTGTGAGCTGCTGCCTCTGTGCGACGCGTGCGTGGAGGTCCCTCAGCGAGGAGTGGTCCGCTCTCTCAACGTGCACGTCACCGCCGCCCTCTTCGTGTGGGAGTACACGCGCCAGCACCTCCTCTAG

Protein sequence:

>DPOGS202835-PA
MYKEGELLSFLDLLDLDEEVIDRRAKSIMQRSTLSSQHLEHFIYLLQYKHLINIRENSECENEEEFNFLSKLIVDADNENANATCTLVNLVLTLNPSTIINKSEHLLQQILSTLNFPPIKRSIIEDGGSTSRETEDTLVKLRICGSVLDAVKKVGARLEQIICTENKQLQGFFLMNTVPKFFETVDDCNILDRIWSFVKQLNDSANALKVLCSLSNYYLPVIEDESENISIESTIIMDSEFWNFILSGLLCEAQCLSLRKQAIYLAKRAIDYAAYRKKDIKITSHSTFIWSKENEKTLRKMWENYFILIDSLQEKQSNIVLPSLKLFNSINDIGHCWLAAAFNIGLKHDNFQVRQKCLIYRFQYRISNQMEAVGVLEALNDPAIFDNRIRHTSVITLLKEALTNVRTLINFLQGMADVKLSPVPLFSLTSCLADLHILLSDSDQKTLLKTIHKILNTPCNNTVLRKAICVNMSHFIANCCKNLHWKEYLLLYPLLAVECNQEFMNPFVSFIQNDMDVPEDEMVQFINSGKKSHLNIQYALIYFRGQDKPLFLEMIDDMIHQVKDTSSRQYSNKLDCLDEAIFISHLYNCRDNNTITHIYYNSEAFQAIMQYIGSLLSDEINLDFDKMNLLLEGFDYALRAVNIINYKENLIQLYKSAEFLLKDINSDLQKKLLALLTLNTLMKSKFRDVCDEGPVIEAFVNIIKNINYSGQKREDGGRLRNSFYENICHLLCTVYTSGSIKDIIYFIDTVVECGGHGCLKWLLNLTNKIITELLEQDSVKFDLIQFLNRMWKEIEDLKSNSQYSICMEHFINLLTHDAVLKRPIYNNLVISYCNKIINYATLKYSPLYFLIRRLTLIDISSYGHMIYILCEILLYANIPNKEQRIAENLQVTILERSDFFGINEECVQFNCHIQYLAVSVLVKIRDNEILDTVVRSVRRRIDELLKNKLRYHENSYLERSIESCLQCLLFIALMTEEVDLKDSAVWCMELLGRMPHQPYVKICLEWFICLYYYFEAEFDHVMEFLTSHSMGPVYGVRLNAQYLATKIVDINDKQKYKLDDKQFTYIIRVIRNCLRQAQELEEKSLMKLSNSCFINSFDIVQSLNFFDVFYKLPLSLTHTARHFDMTNKFLQTVTMDIEACLETGLRGELLSGRFVVPEKIRCEFLTDVIDDEEPEIIQKKYVPWNGMSDVDAHSERHVRVIIHINQIQTVASTALVVVTAPDDETGEVFGVHTYVMDSLRHLQNRMFQDLSVSAERWLNVEEVRPGEPLKRYLMTKKAEGHIVVAAEQTSNSVKLQHFKFPKKTILMLGHEKEGVSCELLPLCDACVEVPQRGVVRSLNVHVTAALFVWEYTRQHLL-