Monarch geneset OGS2.0

DPOGS216155
TranscriptDPOGS216155-TA1386 bp
ProteinDPOGS216155-PA461 aa
Genomic positionDPSCF300155 - 401447-405948
RNAseq coverage227x (Rank: top 44%)
Annotation
Heliconius% 
BombyxBGIBMGA014172-TA9e-3839.42% 
Drosophila% 
EBI UniRef50UniRef50_UPI00022CA66D3e-1026.34%UPI00022CA66D related cluster n=1 Tax=unknown RepID=UPI00022CA66D
NCBI RefSeqXP_001689111.12e-0829.19%AGAP010319-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3504249841e-0926.34%PREDICTED: snRNA-activating protein complex subunit 1-like [Bombus impatiens]
NCBI nr blastxgi|3504249846e-1025.34%PREDICTED: snRNA-activating protein complex subunit 1-like [Bombus impatiens]
Group
KEGG pathway 
InterPro domain[12-87] IPR0191886.4e-12Small nuclear RNA activating complex (SNAPc), subunit SNAP43
Orthology groupMCL19025 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216155-TA
ATGTCACGCTACAGTCACCACATATATAAAGTTTATATAGCAGATGGATTTGCGACTGATTGTGATGAACTGATTCATCGCTGCCTGCAGCTGTCCAAGTTGGATTATCAAGAGTTCTGCAAAATTTGGAAGGACCTGGATTTTAGCATGATTTTTCACGGTAGAAATTCAGGTGCAGAGATAGCAGAACTATCGGAAGAACTTGTGTATATAAGTAAACAATATCTGCTGAAAAATACTGAAAATTTTGAGCCGTATACTAACTTTGCATGGCTTCGTCTCACTCCAGATGATATACCGGCCATTAAAAGAATAGAGTTAGTTGCAAGACAGGACAGGAGACTGGATCTGCTGTATATTCTTGGGGAGATACTTATCAAGTACACTCAATATCATGCCGTTGAGAGGGAGAGGGGTATGGAATCTGTGTTGAGGAAATATCTGGATGGTTACACTAGTATAGACAAACTGGGTGTCCGACCTAAAGGTGTGTTCTACCGACAAAACGAAGAGCTGGATATTATAAGAGATTTGGGAACCGTCTCAAGACAGTACACAAAAGCTAAGGACATGCTCAGAGTTGCTGGTCGGCCGGATCCCAGCCTGCAGTACATCAATGAAAACTTGCCGTTTGAGTTGAATGTTACCCTGAAGAAGATTATCAATGGAGCTATAGACGATGATTGTGACGATTCACCCGACGAGCATTATAACATGGTGCAAGCTATTAAGACCAGGGCGGTGACGAATACTGTCGACAACATGAGACATCTCACAGCAGTCGAAGACAGGATATCTACGCAGGAGAATTCAAGTCCAAGGACAAGTAAACAGGGGAAGAGTGATTTGAAAACTCCAACCGGTAGAGTCAAATCTGGTAGTCCAACCAAAAGAATGACCAGCTCGCCAGGAAAAAGGAAGTTGGATGTGAAATCGGTAAGAAAAAAGAAGATCTGTGTGGAAAATACAAAATGGGATTCAGGAAGCGCTGAAATAGATGTGGAGGATTTTCATAAGGCAGCAGCGAAGGTCATAGCTGAAGGTGCTGAGGGTGGAATCACACACACAAAGGAAACGGATATACATATAGATCCTACGGCTGTGGTTCTAGGGAAGGATGTAGCGAAGAACTTGGAGATCGAGTTCATTGATGTAGGGAATCTATCAGTAGACGGTAATGGGGACAACGAATCTGGAAGAGATACTGATTCCACAAGCGACACAAACAGAACAGCTATCAAAACTAAAATAAAGAAACCAGAGAAGAGAGAGCTCAAGAGACTGCATTTGAAATCTAAATTCAAACGTTTAGGAATGTTACCAGTCGCTAACTTTAATGATAAAAATGGCGACCTTGACAATAATACGGACCAAATGGGAAATTAA

Protein sequence:

>DPOGS216155-PA
MSRYSHHIYKVYIADGFATDCDELIHRCLQLSKLDYQEFCKIWKDLDFSMIFHGRNSGAEIAELSEELVYISKQYLLKNTENFEPYTNFAWLRLTPDDIPAIKRIELVARQDRRLDLLYILGEILIKYTQYHAVERERGMESVLRKYLDGYTSIDKLGVRPKGVFYRQNEELDIIRDLGTVSRQYTKAKDMLRVAGRPDPSLQYINENLPFELNVTLKKIINGAIDDDCDDSPDEHYNMVQAIKTRAVTNTVDNMRHLTAVEDRISTQENSSPRTSKQGKSDLKTPTGRVKSGSPTKRMTSSPGKRKLDVKSVRKKKICVENTKWDSGSAEIDVEDFHKAAAKVIAEGAEGGITHTKETDIHIDPTAVVLGKDVAKNLEIEFIDVGNLSVDGNGDNESGRDTDSTSDTNRTAIKTKIKKPEKRELKRLHLKSKFKRLGMLPVANFNDKNGDLDNNTDQMGN-