Monarch geneset OGS2.0

DPOGS200625
TranscriptDPOGS200625-TA2871 bp
ProteinDPOGS200625-PA956 aa
Genomic positionDPSCF300076 + 221992-230870
RNAseq coverage348x (Rank: top 34%)
Annotation
HeliconiusHMEL0031430.084.91% 
BombyxBGIBMGA008911-TA0.088.66% 
DrosophilaCG6841-PA0.075.60% 
EBI UniRef50UniRef50_B4KZ760.068.33%GI13491 n=2 Tax=Coelomata RepID=B4KZ76_DROMO
NCBI RefSeqXP_623891.20.078.70%PREDICTED: similar to CG6841-PA [Apis mellifera]
NCBI nr blastpgi|3838498720.078.70%PREDICTED: pre-mRNA-processing factor 6-like [Megachile rotundata]
NCBI nr blastxgi|3838498720.078.70%PREDICTED: pre-mRNA-processing factor 6-like [Megachile rotundata]
Group
Gene OntologyGO:00056347e-43nucleus
GO:00003987e-43nuclear mRNA splicing, via spliceosome
GO:00054884.3e-25binding
KEGG pathwayame:5514930.0 
 K12855 (PRPF6, PRP6)maps-> Spliceosome
InterPro domain[21-166] IPR0104917e-43PRP1 splicing factor, N-terminal
[517-787] IPR0119904.3e-25Tetratricopeptide-like helical
Orthology groupMCL12929 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200625-TA
ATGTCAGTTCCTCCGCAAGCATTTGTAAACAAAAACAAAAAACATTTTCTTGGTATTCCGGCACCTCTTGGTTATGTGGCTGGTGTTGGTAGAGGAGCTACAGGTTTTACTACTAGATCAGATATTGGACCCGCCAGAGACGCCAATGACGTATCTGATGATCGTCATGCACCCCCAGCAGCCAAGCGAAAAAAAACTGAAGAGGAAGACGATGATGAAGACTTGAATGACTCTAATTATGACGAATTTTCAGGTTATAGTGGCTCTCTCTTTTCAAAGGATCCATATGACAAAGATGATGCAGAGGCAGATGCTATATATGAGTCAATTGATAAACGAATGGATGAAAAAAGAAAAGAGTATAGAGAGAAGAGACTTAAAGAGGATTTGGAGAGATATCGCCAAGAGAGGCAATTTTCTGATCTCAAACGGGAATTGAAAATGGTGTCGGAGGATGAATGGGCTGCTATACCGGAAGTTGGTGACGCGAGGAACAGGAAGCAAAGGAATCCAAGAGCGGAGAAATTTACTCCTTTACCAGATAGTGTGTTATCTAGGAATCTTGGTGGAGAGTCTAGTTCAACAATTGATCCTAGTTCAGGCTTAGCTTCTATGATGCCGGGGGTTATGACACCTGGAATGCTGACACCTTCAGGTGATCTCGATCTACGTAAGATAGGTCAAGCGAGGAACACTTTAATGACGGTGAAATTGTCACAAGTCTCCGACTCTGTGAGCGGTCAGACAGTGGTGGACCCTAAAGGTTACTTAACTGACTTACAGTCCATGATACCTACCTATGGCGGTGACATTAATGACATCAAAAAGGCCAGGCTGCTCCTCAAGTCGGTGAGGGAAACCAATCCTAACCATCCACCAGCTTGGATTGCTAGTGCTAGATTAGAAGAAGTTACTGGTAAAATCCAGTCAGCCCGCAACCTCATAATGAAGGGTTGTGAGGTTAACCCCAGCAGTGAAGAGCTCTGGTTGGAAGCGGCTCGTCTACAACCACCGGATACAGCTCGGGCGGTTATAGCACACGCCGCCCGCAACCTGCCTCATAGTGTACGAGTTTGGGTGAAGGCGGCTGAACTGGAACAAGAACCAAAGGCTAAACGTCGTGTTTACAGAAAGGCGTTGGAGCATATACCAAATTCAGTGCGTTTGTGGAAAGCGGCCGTCGAATTGGAGAACCCTGAAGATGCTAGGATCCTGCTTTCAAGGGCCGTGGAGTGTTGTCCGACGAGCGTAGAACTATGGCTGGCTCTGGCTAGACTGGAAACATATGAAAATGCAAGAAAAGTACTAAATAAGGCACGTGAAAATATTCCCACCGATAGACAGATCTGGGTAACAGCTGCTAAACTTGAAGAGGCTCAAGGCAACACTCATATGGTAGAAAAGATTATAGACCGTGCCATAACGTCGCTTAGTGCTAATGGCGTTGAAATAAACAGAGAGCATTGGTTCAAAGAGGCGATGGAGGCTGAGAAATCTGGAGCAGTTCATACGTGTCAGGTGATCGGTCACGGCATTGAACCAGAGGATCAAAAACATACTTGGATGGAGGATGCTGATGCTTGCGCCAACGAAGGTGCGTACGAGTGTGCCCGGGCGGTGTATGGGTACGCGCTATCAGTTTTCCCCTCGAAGAAGTCCATCTGGCTGAGAGCCGCCTACCTCGAGAAGCAGCATGGTACGAGGGCGACGTTGGAGGCTCTGTTACAGAGGGCGGTCGCTCACTGTCCCAAGAGCGAAGTCCTATGGCTCATGGGGGCGAAGTCCAAGTGGCTAGCGGGTGACGTGAGAGCGGCTAGACAGATCCTGTCGTTAGCTTTCCAAGCCAATCCTAACTCGGAGGAGATCTGGCTGGCCGCTGTCAAACTGGAGAGCGAGAACAAAGAATATGATCGAGCCAGGAGGTTGTTGGAGAAAGCCAGAGCGTCCGCACCCACACCTAGGGTCATGATAAAATCAGCAAAACTAGAATGGGCTTTGAACAAATTAGACGTAGCCCTGAACCTGCTGTCAGAAGCTATCACAATATTTGGGGATTACGCGAAGCTACACATGATGAAAGGACAGATAGAGGAGCAGATGGGGAGGGATAGTGACGCACACAACACGTACACACAAGGGTTGAAGAAGTGCGCTACCAGTGTCCCTATGTGGATACTGCTGTCGAGATTGGAAGAAAAACTCAAACACGTCACCAAAGCCAGATCTGTGTTGGAGAAGGCGCGTCTCAGGAATCAGAAGAACGCTGAGTTATGGTTGGAGAGTGTTCGCCTGGAACAGCGAGCTGGTTGTGTGGAAGCGGCCGGCTCCTTGTTGGCGAAGGCGCTCCAGGAGTGTCCTACGGCCGGCAGACTGTGGGCCCTCGCCGTCTTCATGGAGCCCCGCCCGCAGAGGAAGACTAAGAGTGTGGATGCCCTGAAGAAATGTGAACACGACGCTCACGTCCTGCTGGCGGTGTCGCAGCTGTTCTGGACGGAGAGGAAATTAAATAAATGCAGGGAATGGTTCAACAGAACTGTGGATGCTCTGAAGAAATGTGAACACGACGCTCACGTCCTGCTGGCGGTGTCGCAGCTGTTCTGGACGGAGAGGAAATTAAATAAATGCAGAGAATGGTTCAACAGAACTGTGAAAATCGACCCGGATCTCGGTGACGCTTGGGCTTACTTCTACAAATTCGAATTGCACCACGGCAACGAACAGCAACAGGAAGACGTGAAGAACAGGTGCAAGGCCGCCGAACCCCACCACGGAGAGAACTGGTGCAAGGTCTCCAAAGACATAGCCAACTGGTGTTACAATACAGAACAGATATTGTTACTGGTGGCTAAGAATCTACCCGTGCCCACGTAG

Protein sequence:

>DPOGS200625-PA
MSVPPQAFVNKNKKHFLGIPAPLGYVAGVGRGATGFTTRSDIGPARDANDVSDDRHAPPAAKRKKTEEEDDDEDLNDSNYDEFSGYSGSLFSKDPYDKDDAEADAIYESIDKRMDEKRKEYREKRLKEDLERYRQERQFSDLKRELKMVSEDEWAAIPEVGDARNRKQRNPRAEKFTPLPDSVLSRNLGGESSSTIDPSSGLASMMPGVMTPGMLTPSGDLDLRKIGQARNTLMTVKLSQVSDSVSGQTVVDPKGYLTDLQSMIPTYGGDINDIKKARLLLKSVRETNPNHPPAWIASARLEEVTGKIQSARNLIMKGCEVNPSSEELWLEAARLQPPDTARAVIAHAARNLPHSVRVWVKAAELEQEPKAKRRVYRKALEHIPNSVRLWKAAVELENPEDARILLSRAVECCPTSVELWLALARLETYENARKVLNKARENIPTDRQIWVTAAKLEEAQGNTHMVEKIIDRAITSLSANGVEINREHWFKEAMEAEKSGAVHTCQVIGHGIEPEDQKHTWMEDADACANEGAYECARAVYGYALSVFPSKKSIWLRAAYLEKQHGTRATLEALLQRAVAHCPKSEVLWLMGAKSKWLAGDVRAARQILSLAFQANPNSEEIWLAAVKLESENKEYDRARRLLEKARASAPTPRVMIKSAKLEWALNKLDVALNLLSEAITIFGDYAKLHMMKGQIEEQMGRDSDAHNTYTQGLKKCATSVPMWILLSRLEEKLKHVTKARSVLEKARLRNQKNAELWLESVRLEQRAGCVEAAGSLLAKALQECPTAGRLWALAVFMEPRPQRKTKSVDALKKCEHDAHVLLAVSQLFWTERKLNKCREWFNRTVDALKKCEHDAHVLLAVSQLFWTERKLNKCREWFNRTVKIDPDLGDAWAYFYKFELHHGNEQQQEDVKNRCKAAEPHHGENWCKVSKDIANWCYNTEQILLLVAKNLPVPT-