Monarch geneset OGS2.0

DPOGS211131
TranscriptDPOGS211131-TA1338 bp
ProteinDPOGS211131-PA445 aa
Genomic positionDPSCF300007 - 311979-313541
RNAseq coverage1203x (Rank: top 10%)
Annotation
HeliconiusHMEL0172274e-14789.26% 
BombyxBGIBMGA003006-TA1e-13778.67% 
DrosophilaCG10555-PA1e-1955.56% 
EBI UniRef50UniRef50_UPI000224778B4e-3147.44%UPI000224778B related cluster n=1 Tax=unknown RepID=UPI000224778B
NCBI RefSeqXP_624990.21e-3045.65%PREDICTED: similar to SSXT protein (Synovial sarcoma, translocated to X chromosome) (SYT protein) [Apis mellifera]
NCBI nr blastpgi|3838569841e-3047.19%PREDICTED: uncharacterized protein LOC100875159 [Megachile rotundata]
NCBI nr blastxgi|3838569843e-7139.71%PREDICTED: uncharacterized protein LOC100875159 [Megachile rotundata]
Group
KEGG pathway 
InterPro domain[11-73] IPR0077261.8e-29SSXT
Orthology groupMCL26882 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211131-TA
ATGTCTGTTGCGTTCGCACCCCGGGGCAACCGGCCACCATTGAGTCCAGCACAAATTCAAAAAATGCTGGATGAAAATGCACATTTGATACAAACAATTCAAGAATATCAAGCGAAAGGGCAGCTTATGGAATGTCATCAATATCAGCAAGTTTTACACAGAAATTTAGTGTATTTAGCATCTGTCGCAGACGCCAATCAGAACATTCAAGCTTTACTCCCGCCACCTCACCAATTAGCTGGTAATGTTCAGCAGGGATCCCTCAACCCTCCTACTAGCGGAGCTGATGTACCAGGATCACCACAACAACCATACCGTCCACCATCAGGAGTGTCTTCAACACCTACAAGACCGACACAATCATATGGTCAAAGACCTTACCCACAGAATCAATATCAAGGACAGTATCAAGGCTCACCAGGCTATCCTCCCCAACCTGGTTATGGACCCCCTGGGCAAGGATATGGTCCACCCAACCCCTCTCAACAACCACAAGGCTATCCACCCAATTCCACATATGGACCTCCTATAACCACATCAAGTAATTATCCCCCCTCTACACACCCAAGTCCAGGATACCCACCCTCTTCTGGTCAGCAACCTTATGCTCCACCTCCTGGAAGCCCCGCAGCTGCAGGTTCACCATATCCAGTCAGAGGATCAAGCCAACCAGCATACACTGGAAATTCAGGTTACCCTCCACCTCAATCGGGACCGAACTATCCCAATGTGGGTGTGAGTTCTTACAATGCTACAACCTCCCAGCCCCAGCCGTATCAATCACAACCATTCCCTAATACTACTCCTACGTCAGCATATAGCTCAACACCAATCTCGCAACCGAACCGCTCTCCTCAACCACCACCATCTGGTTATACATCTCAAAATCCGACCAGCACTGGTTATGGTTCTCCATCTGCACAGTCACCGACATATAACTCTAGTGCCCATGGAAATAATGGTCAAAGTTCAACACCATCCTCAGGAGGACCACCTCCAAGTGGTACCCCATCTGGTTCACAGTATCCTCCATCTAGTCAACCTCCATATCCTCCAGCAACACAACCCCCTTATTCTAATCCATCATCTCAGCCTGGTAGTCCATCACCATCAGTCTCAACAGCACCACCTCCTCAGTCTTCATACCCACAGAATCCTCAAAATTATCCTCCTGGAGGAGGCGCATATCCTCCCCACGCCTATCAACAAGGCTATCCTCCGGCCCAGTATCCCCCGTCTCCCTATCCATATGCTCGGGCCCCAGCACCCGGTGCTCCTCCTCCAGGTGCGCCACAACCCTATCCAGGTTATGGATTCCAGCCCCCTAATCAGCAGTAA

Protein sequence:

>DPOGS211131-PA
MSVAFAPRGNRPPLSPAQIQKMLDENAHLIQTIQEYQAKGQLMECHQYQQVLHRNLVYLASVADANQNIQALLPPPHQLAGNVQQGSLNPPTSGADVPGSPQQPYRPPSGVSSTPTRPTQSYGQRPYPQNQYQGQYQGSPGYPPQPGYGPPGQGYGPPNPSQQPQGYPPNSTYGPPITTSSNYPPSTHPSPGYPPSSGQQPYAPPPGSPAAAGSPYPVRGSSQPAYTGNSGYPPPQSGPNYPNVGVSSYNATTSQPQPYQSQPFPNTTPTSAYSSTPISQPNRSPQPPPSGYTSQNPTSTGYGSPSAQSPTYNSSAHGNNGQSSTPSSGGPPPSGTPSGSQYPPSSQPPYPPATQPPYSNPSSQPGSPSPSVSTAPPPQSSYPQNPQNYPPGGGAYPPHAYQQGYPPAQYPPSPYPYARAPAPGAPPPGAPQPYPGYGFQPPNQQ-