Monarch geneset OGS2.0

DPOGS211010
TranscriptDPOGS211010-TA1761 bp
ProteinDPOGS211010-PA586 aa
Genomic positionDPSCF300004 + 1176562-1187207
RNAseq coverage385x (Rank: top 31%)
Annotation
HeliconiusHMEL0080919e-12072.81% 
BombyxBGIBMGA006492-TA3e-6796.75% 
Drosophilafru-PN1e-6387.90% 
EBI UniRef50UniRef50_Q8IN812e-6183.58%Sex determination protein fruitless n=9 Tax=Drosophila RepID=FRU_DROME
NCBI RefSeqNP_001157690.15e-7739.75%fruitless [Tribolium castaneum]
NCBI nr blastpgi|2559582171e-7539.75%fruitless [Tribolium castaneum]
NCBI nr blastxgi|2559582175e-7839.96%fruitless [Tribolium castaneum]
Group
Gene OntologyGO:00055151.1e-24protein binding
GO:00036761.1e-05nucleic acid binding
KEGG pathway 
InterPro domain[87-200] IPR0113334.7e-30BTB/POZ fold
[107-200] IPR0130691.1e-24BTB/POZ
[115-210] IPR0002101.2e-22BTB/POZ-like
Orthology groupMCL30434 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211010-TA
ATGAAGTTCGCGTTGCGAATACTGGCCGATTATGATGTGAAAGATATTCCGAAGGCCGTGAGACGGCTAGAGGTTTTCGCGTCCCGTAGGGTCGGTGGCCCCGTCGTCAGCATTCTGGAATTAAATTACATAATGTATGGGTCTAGTTTGGAGCATGCAACAAGTAGATATGTGTCGGCGCTCGGCGGTCTAACTGTCAATGTGGAGTACACTTTTGATCATTTTTCAGCACGATCACTGACACTAGCACCAACAATGGACCAGCAATTTTGTTTGCGCTGGAACAATCATCCAACCAACCTGACAGATGTGCTTGCAAGCCTATTACAGAGAGAGGCACTATGTGATGTTACACTAGCATGCGATGGGGAAACAGTCAAGGCACACCAGACAATACTATCAGCGTGTTCCCCGTATTTTGAAAGTATATTCTTACAAAATTCACACCCGCATCCCATTATATTCCTTAAAGATGTGAGGTTCTCAGAGATGAAATCTCTGTTAGATTTTATGTATAAGGGAGAGGTGAATGTTGGCCAAAATATGCTACCAATGTTCCTAAAGACTGCCGAAAGTTTACAAGTTAGAGGTTTGACAGAGAATAATACGTTGAATACTAAGTCAGAGGAGCGGTCGACTCCCAGCGTGAGTGCTGAGAATTTATCCCGCGGTGAGTTCGCCACACCGCCTGCTGCTCATGCGCTCGCAGCTCTAACACCGCTGCCGCAGTCACAGTCACTGCCGCAGTCGCTGCCGCAATCGCTGCCGCAGCCGCTGCCGTTGCCGTTGCCGCCGCACGCGCCGCTCGAGAAGCGACGCAGGAAGAACTCCACCGCGCCAAGGGACGATATCGATCTGTCCTACCGACATTATGAGGGGCACGTGAAGGCTAGCAAAGGTTCAACCGGCTCTGGTTCCGAGCCGTCGACTCCTCCACCAGCTCACGGCCGCGCCGCTCGCTCCCCAGCATTGCTCGTTAAACAAGAGCCAGACTACACGCAACACCACTCCTACGACCAGACTCACCTCACGATGGAGGAGCCAGATGATCAAGACGTGTCTGCGAACCGAGCTCAAACACGGCGAAATGGAATGGGAGTGAATGATATGGCATCAATGATAACCCAGCACTCGATGAACAACGATTGCAACGAGAGCGAACCCGTAATGCCGCCTCACCCCGACCAGACGGACACCATTGACGGTGATAAATATGCGGACGAGAATGATATACCTCAAGAGCATTTCGGCCAAAATATAACAAATATTGAGAATATAGTAAAATCTTTTAGGATAGCATTAAATCATAGATCACATAGCCCAATGACCTGTCAGATATGTGGTAAAACTGTAAGCAATATCAAGAAGCACATGAAATCACACAATCCAGAACAACACAAATGCCCTCTCTGCTCGAAGGGCTGGCACATGAGGCTGACGTTCGAGCGTGTGGCGGGCGCCCTCAACCTGCACCGCTGCAAGCTGTGCGGGAAGGTGGTCACTCACATCAGGAACCACTATCACGTGCACTTCCCTGGACGGTTCGAGTGCCCGCTATGCCGAGCCACCTACACGCGCTCGGACAACCTGCGCACGCACTGCAAGTTCAAGCATCCGGCTTACAACCCCGACACGCGCAAGTTCGAGGGCGCGCCGGTGGCCGTGGGCGCGGGCGTGGGCGTGGGTGGCGCTCACGGGCCTCACGCAGCTCATGCGCCGCCGCCGTTGTTCGCGAACCACCTGGACGCGGGCTTCGACTGA

Protein sequence:

>DPOGS211010-PA
MKFALRILADYDVKDIPKAVRRLEVFASRRVGGPVVSILELNYIMYGSSLEHATSRYVSALGGLTVNVEYTFDHFSARSLTLAPTMDQQFCLRWNNHPTNLTDVLASLLQREALCDVTLACDGETVKAHQTILSACSPYFESIFLQNSHPHPIIFLKDVRFSEMKSLLDFMYKGEVNVGQNMLPMFLKTAESLQVRGLTENNTLNTKSEERSTPSVSAENLSRGEFATPPAAHALAALTPLPQSQSLPQSLPQSLPQPLPLPLPPHAPLEKRRRKNSTAPRDDIDLSYRHYEGHVKASKGSTGSGSEPSTPPPAHGRAARSPALLVKQEPDYTQHHSYDQTHLTMEEPDDQDVSANRAQTRRNGMGVNDMASMITQHSMNNDCNESEPVMPPHPDQTDTIDGDKYADENDIPQEHFGQNITNIENIVKSFRIALNHRSHSPMTCQICGKTVSNIKKHMKSHNPEQHKCPLCSKGWHMRLTFERVAGALNLHRCKLCGKVVTHIRNHYHVHFPGRFECPLCRATYTRSDNLRTHCKFKHPAYNPDTRKFEGAPVAVGAGVGVGGAHGPHAAHAPPPLFANHLDAGFD-