Monarch geneset OGS2.0

DPOGS206861
TranscriptDPOGS206861-TA1002 bp
ProteinDPOGS206861-PA333 aa
Genomic positionDPSCF300001 - 2790450-2802195
RNAseq coverage428x (Rank: top 29%)
Annotation
HeliconiusHMEL0061511e-10493.85% 
BombyxBGIBMGA013823-TA9e-4646.63% 
DrosophilaSxl-PH7e-7768.02% 
EBI UniRef50UniRef50_Q3HM268e-15480.42%Sex-lethal isoform 1 n=5 Tax=Obtectomera RepID=Q3HM26_BOMMO
NCBI RefSeqNP_001036780.12e-15480.42%sex-lethal isoform L [Bombyx mori]
NCBI nr blastpgi|898856551e-15380.42%sex-lethal [Bombyx mori]
NCBI nr blastxgi|898856553e-15780.42%sex-lethal [Bombyx mori]
Group
Gene OntologyGO:00037233.2e-40RNA binding
GO:00001663.2e-28nucleotide binding
GO:00036763e-23nucleic acid binding
KEGG pathway 
InterPro domain[61-76] IPR0023433.2e-40Paraneoplastic encephalomyelitis antigen
[54-139] IPR0126773.2e-28Nucleotide-binding, alpha-beta plait
[62-135] IPR0005043e-23RNA recognition motif domain
Orthology groupMCL16301 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206861-TA
ATGTGCGAGCGGTACCCAGGATATCTCTTCGGAGGGACTCCGGTCAGCAGCTGGGACAGAAGTTTTAATTCACTAGAAAATCAAATCCAGCAGCAGTATCGAGGGCGTGTAGGTGCGGGTGTCCGTTGCTGGGGCAGTATGTCAGAGGGCGGGATATCCTCTGCGGGTGACGCGGGTAAGACCAACCTGATCGTCAACTACCTGCCGCAGACTATGACGGAGAAGGATCTCTATGCGATGTTCATGTCCATAGGACCCATAGAGAGCTGCAGGGTTATGAAGGATTTTAAGACTGGTTATAGCTATGGGTTTGGTTTCGTCAACTTCACAAGAGAGGAAGACGCGGCACGAGCGATCGAAACATTCAATGGATATCAATTGAGGAATAAGAGACTGAAGGTGTCCTACGCTCGTCCTTCCGGTGAGGATATAAAAGAGACCAACTTGTACGTAACGAATCTGCCGCGTGCTATAACCGAGGACCAGTTGGAGACGATATTCGGGAAGTACGGCAGAATTGTACAGAAACATATACTGAGAGACAAGAGCAACGGCACTCCGAGAGGTGTCGCTTTTGTGCGGTTCGACAAACGTGAGGAGGCTCAGGAGGCTATCGCGGCGCTAAACAACGTTATTCCCGAGGGTGGTTCCGAGCCGCTGTGTGTTAAGGTTGCTGAAGAACACGGGAAGCAGAAGGCCGCGTACTACGCCGGTTGGGCCGCTGGCTTTCATCACAACAGGGGGGACTTTTCCTGGAACTGTGGGAACTGCATGATGCCGGATGCCTGGGAAAGGTTCCCGTCCCCACCATTGGGAGGCAGTCATCCAAATACTCCCAGAAACGGATGTAGGCCGGGATCCTGCGCCAACACTAGGATATTCCCCAACTACCCAAAAAACTCACCCAGGAACAATAGGGGACGAAACTTCCCCTGGGGTCAGGGTGTTGAAGATAGGTTTAAGGGACAGCGGTGTAGAAACGGTCAAGCGCCTTATTGGTAA

Protein sequence:

>DPOGS206861-PA
MCERYPGYLFGGTPVSSWDRSFNSLENQIQQQYRGRVGAGVRCWGSMSEGGISSAGDAGKTNLIVNYLPQTMTEKDLYAMFMSIGPIESCRVMKDFKTGYSYGFGFVNFTREEDAARAIETFNGYQLRNKRLKVSYARPSGEDIKETNLYVTNLPRAITEDQLETIFGKYGRIVQKHILRDKSNGTPRGVAFVRFDKREEAQEAIAALNNVIPEGGSEPLCVKVAEEHGKQKAAYYAGWAAGFHHNRGDFSWNCGNCMMPDAWERFPSPPLGGSHPNTPRNGCRPGSCANTRIFPNYPKNSPRNNRGRNFPWGQGVEDRFKGQRCRNGQAPYW-