Monarch geneset OGS2.0

DPOGS211118
TranscriptDPOGS211118-TA1287 bp
ProteinDPOGS211118-PA428 aa
Genomic positionDPSCF300007 - 483005-484291
RNAseq coverage5x (Rank: top 88%)
Annotation
HeliconiusHMEL0124252e-16668.20% 
BombyxBGIBMGA002996-TA1e-17769.53% 
DrosophilatilB-PA3e-9752.94% 
EBI UniRef50UniRef50_D6WHE03e-9752.77%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WHE0_TRICA
NCBI RefSeqXP_319679.28e-10049.23%AGAP008927-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|583928652e-9849.23%AGAP008927-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|910790721e-9849.74%PREDICTED: similar to testis specific leucine rich repeat protein [Tribolium castaneum]
Group
KEGG pathwayngr:NAEGRDRAFT_310692e-54 
 K11092 (SNRPA1)maps-> Spliceosome
Orthology groupMCL13274 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211118-TA
ATGGTTCTAATAACGGTAGAACTGGTGAGAAATAAGGCTGAACATCACGATCGACTGCTGGCTCCTCTAGAAGAAATAGCCTTACATCAGGAAAATATAGAAAAAATCGAACACATACAGGACTGGTGTCCGAAGCTCAAGATACTTTTAATGCAAAGCAATTTAATAAGTAAAATAGAAAATTTAAACAGGTTAAAACACCTCACTTACCTAAATTTAGCACTTAATAATATTGAAATTATAGAAAATTTGGAAAGATGTGAATCGCTACAGAAATTAGATCTGACACTCAACTTTATTGGGCATATTATTTCTATTGAATCCCTTGTTGGTAACTATAACTTAGAGAACCTATATTTAACTGGAAACCCGTGCACGGATTATGATAATTATCGTGAGTTTGTTGTCGGGACTCTTCCTCAACTCATGATACTCGACGGAAAAGAGATTGAAAGATCGGATCGCATCAAAGCATTACAAAACCTAAGGATCATTAGATCCGACATACTTTTTGAACAAAACAACTATCTATGTCAAAGGAAATCACAAAAGGTACGTCTCGAGAAAAACATAGCAGCAAAATGGGAAAATGAATACAAAAATATGGATCCTGATGAAAGGAATAAAAAATTTTGGGCTGAAAAATGTGAGCATGCACCTGAAGTTAGATACGAAATGGAACGGATGCGCCAACTTAAGCTAAAAAGCCTTGCGCCTGATGAAAAAAAAGAGGAAAAACGTGAATATAAATTTTTCACAACAGATGGCAGGCCATTTAATATAAACCAAGCAAGGATAGATTTTAAGTTTAGTGATACTGAGCCGGATAAGTACGTATTAGACTTGGCCATTTACAAACATTTAGACACCAGTTTATTAAACATAGATGTCCAGCCGAACTATGTGAGAGTTACACTAAAAGGAAAAATATTCCAACTCCACCTTCCGGAAGAAGTTAACACATCCGAATCTAAAGCTCAGAGGTCGCAGGTCACAGGTCATCTTATTGTAACTATGCCCAAAGCGAATATCGTCATTAAAGAGCATAAATCTCCAAAAGTAATGAATCGTGACGAAAAATCCAGAAGAAATGACAAACTGCCCGAATATAAAGGGTCAGGGCAGTCTAAAAGAGAGTATTTAGAAATCGGACCCTCTGATAATACTCTGGACTTCAGAAACATGATTACAAAGAAACCCGATTTCATTGATCCCAGAATGTCTTTAAAAGGGAAACAGCCGTCAGAAGGTTTTATAGACGATCCGCAGGTTCCAGGTCTTATTTAA

Protein sequence:

>DPOGS211118-PA
MVLITVELVRNKAEHHDRLLAPLEEIALHQENIEKIEHIQDWCPKLKILLMQSNLISKIENLNRLKHLTYLNLALNNIEIIENLERCESLQKLDLTLNFIGHIISIESLVGNYNLENLYLTGNPCTDYDNYREFVVGTLPQLMILDGKEIERSDRIKALQNLRIIRSDILFEQNNYLCQRKSQKVRLEKNIAAKWENEYKNMDPDERNKKFWAEKCEHAPEVRYEMERMRQLKLKSLAPDEKKEEKREYKFFTTDGRPFNINQARIDFKFSDTEPDKYVLDLAIYKHLDTSLLNIDVQPNYVRVTLKGKIFQLHLPEEVNTSESKAQRSQVTGHLIVTMPKANIVIKEHKSPKVMNRDEKSRRNDKLPEYKGSGQSKREYLEIGPSDNTLDFRNMITKKPDFIDPRMSLKGKQPSEGFIDDPQVPGLI-