Monarch geneset OGS2.0

DPOGS214056
TranscriptDPOGS214056-TA2157 bp
ProteinDPOGS214056-PA718 aa
Genomic positionDPSCF300171 - 119903-225761
RNAseq coverage96x (Rank: top 62%)
Annotation
HeliconiusHMEL0146532e-8394.97% 
BombyxBGIBMGA010389-TA5e-7699.24% 
Drosophiladac-PB5e-9568.10% 
EBI UniRef50UniRef50_B4JZ695e-9771.48%GH25029 n=9 Tax=Endopterygota RepID=B4JZ69_DROGR
NCBI RefSeqXP_002427868.19e-10775.49%dachshund, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3504032444e-10751.48%PREDICTED: hypothetical protein LOC100741294 [Bombus impatiens]
NCBI nr blastxgi|1571310854e-16848.08%dachshund, putative [Aedes aegypti]
Group
Gene OntologyGO:00056341.4e-47nucleus
GO:00001661.5e-40nucleotide binding
KEGG pathway 
InterPro domain[18-130] IPR0033801.4e-47Transforming protein Ski
[24-131] IPR0090611.5e-40DNA binding domain, putative
Orthology groupMCL14750 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214056-TA
ATGATGCATCACTCGCCGCTGGAGCTGATGGCAGCCGCTCACCATCACGGCCCACCGCGCTATGGCAGCCCGCCGCCTATATCCACATCCGACCCATCCGCTAATGAGTGCAAGCTGGTGGATTACAGAGGACAGAAGGTCGCAGCCTTCATCATCCAGGGCGACACCATGCTCTGCTTGCCGCAGGCTTTCGAGCTGTTCTTGAAGCATCTAGTGGGCGGGCTGCACACAGTGTACACGAAGCTGAAGAGACTGGACATAGTGCCGCTGGTGTGCAATGTAGAGCAGGTTCGCATCCTGAGAGGTCTGGGCGCGATACAGCCTGGGGTCAACCGCTGCAAGCTGCTCTCCTGCAAGGACTTCGACGTGCTCTACCGCGACTGCACCACGGCAAGCTCTAGACCGGGGCGACCTCCGAAGCGTGCCTCGGGTGTGGGCCTGTCGCTAGCCGCCACCCAATTCCCTGGACATCCCTTCAAGAAGCATCGCCTTGAGAACGGAGACTACTCCCCTTATGAAAACGGACATATGAGTGAAATGGCTCGTATGGACAAGTCCCCCCTGCTCGCTAATGGGTACAACGCGCCCCCCACGCACCTGGGACCCATGGGCTTCATGCATCAGCACGCGCTCATGTCCCCCAGTATGCACCACGGAGTCCCTCGACCTGACGGATCCATCATCAAAGGCCAGCCGATGCACAATATGGAGGCACTGGCGAGACAGAGTACGTTTCTTTACGGTGGGTGCAATAAAATAAAACAGTTGTCTCCGTACGTGGCAGAATTACCAAAACTTATAGAAATGGCTCGTATGGACAAGTCCCCCCTGCTCGCTAATGGGTACAACGCGCCCCCCACGCACCTGGGACCCATGGGCTTCATGCATCAGCACGCGCTCATGTCCCCCAGTATGCACCACGGAGTCCCTCGACCTGACGGATCCATCATCAAAGGCCAGCCGATGCACAATATGGAGGCACTGGCGAGATCTGGTATTTGGGAGAATTGTAGAGCAGCTTACGAAGACATCGTAAAACATCTAGAAAGGTTGCGGGATGAGAGAGGTGACATCGAACGTGTGATAGCGATGGACAAAGCACGCGAGGGTTCACATAACGGCTCCTCTCCAGGTCACAGTCCTGTCTTGAACCTCTCGAAATCAGGTTCTGGAGAACGGGAACGAGAACGAGAGCGCGAACGTGGGGAAAGGGGGGAGGGTTCTGCCAGTGGTCGAAGCTCAGCAGCTTCTCGCCGCACACCCCAACCTCCCCGCATACCATCCACAGCAGCAGCTGCTTCACCCAGATCTCATACTGACGATAGTGATCCCGGTCTCTCTGACCAAGAAGACCATAATGTTAAAGACGAAGATGACGGCGCTGAGTTGAGTGACGGTGAGCGTGATATGCCAACGAATACATCACCAGCTGCTGTCAACTACCCGACTCAAGGATCTCCCTCAAACGTGCCCGTGGACCCGTCCGCAGACACCCTGGTCTCCTCAACGGAAACGCTCCTGAGGAACATTCAGGGGCTGCTCAAGGTTGCAGCAGACAACGCACGCCAGCAAGAACGACAGATTAGCTACGAAAAAGCCGAACTGAAGATGGACGTCCTAAGAGAACGAGAAGTGAAAGACAATTTGGAAAGACAACTACTCGATGAACAGAAGATGAGAGTTATGTATCAGAAGAGACTAAAAAAGGAAAGAAAACAACGACAACAAATACAAGAACAGTTAGAAATGGAACTTAAGAGACGACAGAAGATTGAAGAAGCATTGAAGCAATCGGGGGCGCCTTCTGAGATACTCAGAATAGTCACTGAGAATCTATCACCGCCCCCAGAGAATAGGGAACGCGAGAATGGTACGGAGAGCAAGCCACCGAGCACGGAGCCGCCCACCACATCACCACCTTTCCAGCGTGACCCACCACGCACACCAGACAAACCACAATGGAACTACCCTCCACCACCCGTTGATATCATGAGTGGAGGAGCAGCCTTCTGGCAAAATTACTCCGAATCCCTGGCGCAGGAGTTGGAGATGGAGCGCAAATCTCGTCAGCAGGCGATGGAGCGTGATGTCAAGAGTCCGTTGTCAGACCGCGCTGGTTACTACAAGAACTCAGTGTTGTTTAGCTCAGCCACTTAG

Protein sequence:

>DPOGS214056-PA
MMHHSPLELMAAAHHHGPPRYGSPPPISTSDPSANECKLVDYRGQKVAAFIIQGDTMLCLPQAFELFLKHLVGGLHTVYTKLKRLDIVPLVCNVEQVRILRGLGAIQPGVNRCKLLSCKDFDVLYRDCTTASSRPGRPPKRASGVGLSLAATQFPGHPFKKHRLENGDYSPYENGHMSEMARMDKSPLLANGYNAPPTHLGPMGFMHQHALMSPSMHHGVPRPDGSIIKGQPMHNMEALARQSTFLYGGCNKIKQLSPYVAELPKLIEMARMDKSPLLANGYNAPPTHLGPMGFMHQHALMSPSMHHGVPRPDGSIIKGQPMHNMEALARSGIWENCRAAYEDIVKHLERLRDERGDIERVIAMDKAREGSHNGSSPGHSPVLNLSKSGSGERERERERERGERGEGSASGRSSAASRRTPQPPRIPSTAAAASPRSHTDDSDPGLSDQEDHNVKDEDDGAELSDGERDMPTNTSPAAVNYPTQGSPSNVPVDPSADTLVSSTETLLRNIQGLLKVAADNARQQERQISYEKAELKMDVLREREVKDNLERQLLDEQKMRVMYQKRLKKERKQRQQIQEQLEMELKRRQKIEEALKQSGAPSEILRIVTENLSPPPENRERENGTESKPPSTEPPTTSPPFQRDPPRTPDKPQWNYPPPPVDIMSGGAAFWQNYSESLAQELEMERKSRQQAMERDVKSPLSDRAGYYKNSVLFSSAT-