Monarch geneset OGS2.0

DPOGS208839
TranscriptDPOGS208839-TA1014 bp
ProteinDPOGS208839-PA337 aa
Genomic positionDPSCF300036 + 806349-808260
RNAseq coverage644x (Rank: top 20%)
Annotation
HeliconiusHMEL0154328e-12390.20% 
BombyxBGIBMGA007945-TA4e-11684.34% 
DrosophilaCG1622-PA2e-9279.02% 
EBI UniRef50UniRef50_B0WBK22e-9164.92%PRP38 pre-mRNA processing factor 38 domain containing B n=8 Tax=Bilateria RepID=B0WBK2_CULQU
NCBI RefSeqXP_001846086.14e-9264.92%PRP38 pre-mRNA processing factor 38 domain containing B [Culex quinquefasciatus]
NCBI nr blastpgi|3016051485e-9156.97%PREDICTED: hypothetical protein LOC100170167 [Xenopus (Silurana) tropicalis]
NCBI nr blastxgi|1700364684e-11262.50%PRP38 pre-mRNA processing factor 38 domain containing B [Culex quinquefasciatus]
Group
KEGG pathwaycqu:CpipJ_CPIJ0045001e-91 
 K12850 (PRPF38B)maps-> Spliceosome
InterPro domain[17-337] IPR0050374.3e-121PRP38
Orthology groupMCL13775 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208839-TA
ATGAGTGAAATTGAAGATTACAATCAACAGAAACCAGGAAAAACAAGCAAGCAGCATAACATATTACCAATATGGGGTAATGAGCAGACTATGAATCTGAATCCTTTAATATTAGCGAATATTCAAGGTTCTAGCTACTTCAAAGTGCATCTGTTCAAGTTGAAAACATATCACGAGGTTGTTGACGAGATCTACTACCAGGTGAAACACCTCGAGCCTTGGGAGCGTGGAAGCCGGAAGACTGCCGGCCAGACGGGCATGTGTGGCGGCGTCAGAGGGGTAGGTGCAGGGGGCATTGTTTCTACAGCTTTCTGTCTTCTCTACAAGTTATATACACTGCGGCTAACTCGCAAGCAAGTCAATGGTCTTCTGCAGCACACAGATTCGCCATACATTAGAGCACTTGGTTTTATGTACATACGCTACACACAGCCTCCAGCTGACTTGTTTGACTGGTATGTCGACTACCTGGACGATGAAGAGGAGGTTGACCCCCGTGCTGGAGGTGGGGGCTCTACTACCATAGGCGCTCTTGTGAGACAAATGCTTATCAAGTTGGACTGGTTCAGCACCCTGTTCCCCAGGATACCTGTACCCATCCAAAAGCAGATAGAACAGAAGTTAGCAGAGCATAATAGACAGAGCAATGCCAGCAAGCCCAGCAACTACCGGGGCGCTATAGGCAACGGTAATAATAGTAGTGCTAGTAGCGCCGGTAACTACAACACGGGCCGCACGGAGTCAGACCGCCGCGAGCACGACGACAGGGACAGACGTGATTATGGAGATGCTGAGAAATATTCAAAAGATAGACCAGTGAGACGCGACGACAGGGATCGGGAGCGTGAGAGAGACAGGGATCGTGAACGAGATCGCGATCGTGATAAGGAGAGGCGGGATCGTGATCGGGTTCGTGAACGTCACCGTGACCGATCACGTTCACGAGACCGTCGTGATCGGTCCCGCGAGCGACCACGTCACAGGTCACGCTCGAGAGATCGCCGCCACAGATAA

Protein sequence:

>DPOGS208839-PA
MSEIEDYNQQKPGKTSKQHNILPIWGNEQTMNLNPLILANIQGSSYFKVHLFKLKTYHEVVDEIYYQVKHLEPWERGSRKTAGQTGMCGGVRGVGAGGIVSTAFCLLYKLYTLRLTRKQVNGLLQHTDSPYIRALGFMYIRYTQPPADLFDWYVDYLDDEEEVDPRAGGGGSTTIGALVRQMLIKLDWFSTLFPRIPVPIQKQIEQKLAEHNRQSNASKPSNYRGAIGNGNNSSASSAGNYNTGRTESDRREHDDRDRRDYGDAEKYSKDRPVRRDDRDRERERDRDRERDRDRDKERRDRDRVRERHRDRSRSRDRRDRSRERPRHRSRSRDRRHR-