Monarch geneset OGS2.0

DPOGS200600
TranscriptDPOGS200600-TA1728 bp
ProteinDPOGS200600-PA575 aa
Genomic positionDPSCF300076 - 512696-517315
RNAseq coverage761x (Rank: top 17%)
Annotation
HeliconiusHMEL0010190.079.55% 
BombyxBGIBMGA011316-TA0.074.87% 
DrosophilaSlu7-PA9e-17569.28% 
EBI UniRef50UniRef50_B0WUI70.060.03%Pre-mRNA-splicing factor SLU7 n=1 Tax=Culex quinquefasciatus RepID=B0WUI7_CULQU
NCBI RefSeqXP_974637.10.062.80%PREDICTED: similar to step ii splicing factor slu7 [Tribolium castaneum]
NCBI nr blastpgi|2294874120.079.55%unnamed protein product [Heliconius melpomene]
NCBI nr blastxgi|2294874120.082.56%unnamed protein product [Heliconius melpomene]
Group
KEGG pathwaytca:6635040.0 
 K12819 (SLU7)maps-> Spliceosome
InterPro domain[160-406] IPR0217151.7e-81Pre-mRNA splicing Prp18-interacting factor
Orthology groupMCL11936 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200600-TA
ATGACAACTGGTACCCGAGTGGCGGTCTCACAAATTTTGCGAAATAAAGACGACACTGGTGACGAAGAGGATGAACCTAAAAAGAAATCTCGTGAGGATTGGAGAAAGGCTAAAGAGTTGGAGGAGGCTCGAAAAGCCGGAACAGCCCCAGCTGCTGTTGATGAAACAGGCAAGGACATAAACCCTCACATACCTCAGTACATTTCCTCTGCACCTTGGTATTATGGTACAGCCGGTCCAACTCTTAAACATCAAAGGCCTCAAGAGGATAGAGAAGCAGAATTCACTAAACTTGACACATATTACAATAAAGGAGTCACTGGTTATGCCGCCACCAAGTATAGAAAGGGAGCCTGCGAGAACTGTGGTGCGATGACGCACAAGAAAAAGGATTGCTTGGACAGACCGAGAAAGATTGGGGCAAAATTCACTAATGTCGGCATAGCTCCGGACGAGTTCAGTCAACCGAATCTCAATCTCAGTTACGATGGCAAAAGGGATAGGTGGAACGGATACAATCCAGAACAACACAAGGCCATTATAGAGGAATATCAGAAAGTTGAAGACGCCAAGAGAGAATTACGAGCGGAAAAACTAGAACAAGACCCGACAGCGACGGAGGAAGATGATCGGGAGGGTGAGGACGAGGATAAATATGTTGATGAAGTGGACATGCCGGGGACTAAGGTGGATTCAAAACAACGAATCACAGTCCGTAATCTCCGTATTCGTGAGGACACAGCCAAATACCTCCGGAACCTGGACATCAACTCAGCCTACTACGATCCGAAGACACGTTCCATGAGGGACAATCCAAATCCGAATGGTGATGAGTCAGAGTACGCCGGTGAAAACTTCGTCCGTTTCAGCGGAGACACTCGCTCCCATGCAAGTGCCCAGTTGTTCGCATGGGACGCTCATCACAGAGGTCTGGATGTACACCTGCTGGCTGAACCCACCAAGTTACAGCTGCTCCAGAAAGAGTACGACGCTAAAAAGGAACAGTTCAAAACACAGGTGAAACAATCCGTGCTAGACAAGTATGGTGGTGAAGAGCATCTGAAGGCTCCTCCGAAGGAACTGCTCTTGGCTCAGAGCGAGGTGTTCCTCAGATACAACAGGGACGGCACCCTGGCTTCAGATGTTGAGAAACAACTGGCCAAGAGCAAATATGAAGAAGATGTTCTGATCAACAATCACACAAGTGTCTGGGGTTCCTACTGGAGGGATGGACAGTGGGGATATAAGTGTTGCCACTCCTTTATAAAGATGTCTTACTGTGTCGGCGAAGCGGGAAAATCTGTTGTATCTGGACACGTGGCGGATGTTCAAGATCCAGATAAGAGCTTGGGAAGTAAAAAAGACAAAGAAGACAAAATTGTAAAATCTGCATCTGAATCGGACTCGGAGAGCAGTTCGTCTTCATCAGAAGAAGAAATAAGAAGTAAAACAGAGAAAACCAGCAAAAAGAAGAAAAACAAGAAGAAAGCTAAGAAGAAGAAGAAGACGGAAAAGAAAGAAGACAAAAACGAAGATAAGTTAAAGAAGGCTCTTGAGATGGAAGAACGCAATCAGCGTAACGCTGATCGTCTGCTATCAATGGATGAAAGGAAACGTCCCTATAACAGCATGTTCGATGTGAAGGAGCCGAGCCAGGACGAGCTGGAAGCTTACATGATGAAGAGGAAGCGAGACGAAGATCCGATGTTACAGTTCATGAATAATTAA

Protein sequence:

>DPOGS200600-PA
MTTGTRVAVSQILRNKDDTGDEEDEPKKKSREDWRKAKELEEARKAGTAPAAVDETGKDINPHIPQYISSAPWYYGTAGPTLKHQRPQEDREAEFTKLDTYYNKGVTGYAATKYRKGACENCGAMTHKKKDCLDRPRKIGAKFTNVGIAPDEFSQPNLNLSYDGKRDRWNGYNPEQHKAIIEEYQKVEDAKRELRAEKLEQDPTATEEDDREGEDEDKYVDEVDMPGTKVDSKQRITVRNLRIREDTAKYLRNLDINSAYYDPKTRSMRDNPNPNGDESEYAGENFVRFSGDTRSHASAQLFAWDAHHRGLDVHLLAEPTKLQLLQKEYDAKKEQFKTQVKQSVLDKYGGEEHLKAPPKELLLAQSEVFLRYNRDGTLASDVEKQLAKSKYEEDVLINNHTSVWGSYWRDGQWGYKCCHSFIKMSYCVGEAGKSVVSGHVADVQDPDKSLGSKKDKEDKIVKSASESDSESSSSSSEEEIRSKTEKTSKKKKNKKKAKKKKKTEKKEDKNEDKLKKALEMEERNQRNADRLLSMDERKRPYNSMFDVKEPSQDELEAYMMKRKRDEDPMLQFMNN-