Monarch geneset OGS2.0

DPOGS212853
TranscriptDPOGS212853-TA1530 bp
ProteinDPOGS212853-PA509 aa
Genomic positionDPSCF300086 + 247793-264399
RNAseq coverage3392x (Rank: top 4%)
Annotation
HeliconiusHMEL0101213e-17193.37% 
BombyxBGIBMGA000798-TA4e-6988.74% 
Drosophilaps-PL5e-14554.95% 
EBI UniRef50UniRef50_Q9GQN34e-14353.54%PASILLA splice variant 4 n=21 Tax=Neoptera RepID=Q9GQN3_DROME
NCBI RefSeqXP_973573.22e-14958.40%PREDICTED: similar to pasilla CG16765-PK [Tribolium castaneum]
NCBI nr blastpgi|1892393354e-14858.40%PREDICTED: similar to pasilla CG16765-PK [Tribolium castaneum]
NCBI nr blastxgi|1951078553e-14456.18%GI24011 [Drosophila mojavensis]
Group
Gene OntologyGO:00037231.8e-13RNA binding
KEGG pathway 
InterPro domain[122-185] IPR0181111.8e-13K Homology, type 1, subgroup
[24-97] IPR0040875e-13K Homology
Orthology groupMCL11731 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212853-TA
ATGCAGTTTTTGAAGCAGAAACCAAATGACGAGCTGCACGTTGAAATGTTCAGATTTATGACGGTGGAGCCGACGTACCACTTCAAGGTGCTGGTGCCGTCGATGGTGGCCGGCGCCATCATCGGCAAGGGTGGTGAGACCATAGCGCAACTGCAGAAGGACACGGGGGCCAGGGTCAAGATGTCCAAATCGCATGATTTCTATCCAGGTACTACAGAACGAGCGTGTCTCATAACGGGGTCGGTGGAAGGCATCATGGTGGTGCTAGACTTCATCATGGAAAAGATCAAAGAGAAACCGGAGCTGGTGAAACCCTTCCCGGAGGGCGTGGATGCCAAGATGCCGCAGGATAGAGACAAGCAGGTGAAGATCTTGGTGCCGAACTCCACAGCTGGTATGATAATAGGAAAGGGGGGCAACTACATTAAACAAATCAAGGAACAGAGCGGCAGCTACGTACAGATATCTCAGAAGGCGAAGGAACTGTCTCTCCAGGAGCGCTGCATCACTGTTGTCGGTGAGAAGGAGAACAACAAGAAGGCCTGCCTGATGATCCTTCAGAAGGTGGTGGACGACCCTCAGTCCGGGTCCTGTCCCAACGTGTCGTACGCGGACGTGGCCGGGCCGGTCGCCAACTACAACCCCACCGGCTCGCCGTACGCCGTGCCCACCACTGAGGTGACAGAGAGTCACGCTCTGGTGGGTGGCGGGTCTGTGGGCGGCGTGGGCGGTGCTGGCGCGCTAGGCGGCGTGGGCGGTGTGGGCGGCGTGCTCGTGAACGGTTCCGGTCTGGGCTCGCTGTCCCTGTCTCTGTCGCTGGCGCCGCCCGGCACCCCGCCGCCGTCCCCGCTCACACAGCACACGCTCGACCACATTAAGGCGGCGCTGCGTCAGGCGGGCTACTCGGAGGCCGGGCTGAGCGAGATCGGCGCGGCGCTGGCTCTGCTGGTGAAGCACGGCGTGCTGGGCCTGGCGCTGCCGGCCGCCCTGCCCGCGCCGCTGTCCGCCGCCTACTTCCCCCTGCAGCCCAGCGACTCGCCCGCTGTCTTCGGACCGCTGGCGCAGGTCCAGCTCGGCGGCGCGCGCGGCGGCTCGTTAGAGCGTTTCGCGGAGGTTGCATTCGAGGCGCTTCGCCCCCCAGCCGTGGCTCCCATCTCGCTGTCGGGCGGCGTGGGGGGAGTGGGGGGCGTACCCGGCTTCCCCTCCGCCAGCCTGCTGCCGCTCTCCAAGAGCCCCACGCCCGCCGACGCGGGCGCCAAGGACTCTAAGAACGTCGAGATCCCGGAGGTCATCGTCGGGGCCATCCTGGGCCCCGGCGGCCGCAGCCTGGTGGAGATCCAGCAGATGTCGGGTGCCAACATCCAGATTTCCAAAAAGGGCACGTTCGCCCCGGGCACCCGCAACCGCATCGTGACCATCTCGGGCACCGCCACCGCCATCAGCAATGCGCATTACCTCATCGAACAGAAGATCCAGGAGGAGGAGCTCAAGCGCACGCGCCACAACGCGCTCTCCGGCCTCATGCAGTAG

Protein sequence:

>DPOGS212853-PA
MQFLKQKPNDELHVEMFRFMTVEPTYHFKVLVPSMVAGAIIGKGGETIAQLQKDTGARVKMSKSHDFYPGTTERACLITGSVEGIMVVLDFIMEKIKEKPELVKPFPEGVDAKMPQDRDKQVKILVPNSTAGMIIGKGGNYIKQIKEQSGSYVQISQKAKELSLQERCITVVGEKENNKKACLMILQKVVDDPQSGSCPNVSYADVAGPVANYNPTGSPYAVPTTEVTESHALVGGGSVGGVGGAGALGGVGGVGGVLVNGSGLGSLSLSLSLAPPGTPPPSPLTQHTLDHIKAALRQAGYSEAGLSEIGAALALLVKHGVLGLALPAALPAPLSAAYFPLQPSDSPAVFGPLAQVQLGGARGGSLERFAEVAFEALRPPAVAPISLSGGVGGVGGVPGFPSASLLPLSKSPTPADAGAKDSKNVEIPEVIVGAILGPGGRSLVEIQQMSGANIQISKKGTFAPGTRNRIVTISGTATAISNAHYLIEQKIQEEELKRTRHNALSGLMQ-