Monarch geneset OGS2.0

DPOGS204256
TranscriptDPOGS204256-TA1653 bp
ProteinDPOGS204256-PA550 aa
Genomic positionDPSCF300046 - 337688-340128
RNAseq coverage663x (Rank: top 19%)
Annotation
HeliconiusHMEL0151890.092.38% 
BombyxBGIBMGA007530-TA0.088.60% 
Drosophilattk-PA3e-7569.81% 
EBI UniRef50UniRef50_D6WNS93e-11345.93%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WNS9_TRICA
NCBI RefSeqNP_001157610.15e-8655.69%tramtrack [Tribolium castaneum]
NCBI nr blastpgi|2700072851e-11245.93%hypothetical protein TcasGA2_TC013842 [Tribolium castaneum]
NCBI nr blastxgi|1565485003e-11145.67%PREDICTED: hypothetical protein LOC100122117 [Nasonia vitripennis]
Group
Gene OntologyGO:00055153.8e-27protein binding
GO:00036761.1e-20nucleic acid binding
KEGG pathway 
InterPro domain[3-121] IPR0113331.2e-32BTB/POZ fold
[21-122] IPR0130693.8e-27BTB/POZ
[31-126] IPR0002101.9e-26BTB/POZ-like
[472-507] IPR0130871.1e-20Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL15854 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204256-TA
ATGGCTACTCAAAGATTTTGTTTGCGTTGGAACAATCACCAGACTAATATGTTGTCGGTGTTTGATCAGCTCCTTCACGCTGAGACCTTCACTGACGTCACGTTGGCTGTTGAAGGCCAATTGCTTAAAGCTCATAAAATGGTTTTATCGGCTTGCAGTCCGTACTTTCAAGCCCTATTTGTGAATCACCAAGAGAAACATCCAATCGTTATACTTAAAGACGTACCTTACTCGGACATGAAAAGCTTGTTAGATTTCATGTACAGGGGTGAAGTGAGCGTCGACCAGGAACGTCTTACGGCGTTTTTGAAAGTAGCTGAGAGTTTAAGAATCAAGGGGCTGACGGAGGTCAATGAGGAAAAATGCGACATACCCGCCCTGACAAACTCTTTGATCCAACAGCAACAGGGCAACACAGCTCACACTCCGCCACCGCAATTACACAGAATACACCCGTATATGCATCAAAAGAGGCCTGCATCTGGCATCCCCAGCGGAGGAGCTCCTCCTAATTTACTGATGCCTCTATTGGGCAATGCGTTGATGCAACCGAAGCGAAAACGAGGAAGACCCAGGAAACTTAGTGGGAGTTCCAGCGACGCCCTCAACACCGCCGCAAGCCCTCCGGGAGAATTCATCCCAGAGTCAGCATCGCATGCGTCATCAAACAGAGCCGGGGACCAACTGATTCGTGGCTCGCCAGAAATGTTAGAGGTCAAAATGTCCATGGATAGTTTTAACGCGGAAGACGGAGTTACTTCTGGAGGCGAGGATGCTGGGGAAGCGCTTATGATAGACGAAGGTGACGACGCACAGTCGAACGAGGCTCCCCAATCCGGCAAAGATTCAGAATCAGCAGATAATGATAAACAGCCCAAAGAGGAAACACATCAAAATTTTCCATCAAACGGTCCGATATTGTCAATCGAGAATGGAGCCATTAAACAGGAACCAGCTTGTGAACAGAATGACGAATACAATGAACACATCGAATACAAATACAATCCTGATAGAAGCCGCGAGAACTCCAACTCCCAGGACGGCCCCATCAAGGATGTCGAGGACAAAATAAGACTAGGTCGTAATTTAAAGCCAAAGAATAGTAAAAAATTACAACCCCAAATGTCTAAGTTTAGAGCACGCACGCTATTCAATCAGCTCTCTGGCCTCTCTAATTTAAATCCCGCACTGAATAGCTTCGATAAATTTCCGCCAGAGACAGTTCTCATGCCGGCTCTCGCCACACAGCTGTTCGCAGCCGAATTGGAACAGAACAACTTGAACATGGTCAATAGCGAAGTGTCCGATTTGGGGCAGACCAACTGGGAACAGCGCATATTCCCATCGCCCATCAGAAAAAATAACATGGCAAACGTCGGCAACTATCACGAGGAGACGAACGAATCCGTCAGGGATTACTGCATCAAGGAGGGCGAGAACGTTTTCAGGTGCAAGATATGCGCCAGGGTCTACACCCACATCAGCAATTTCTGCAGACATTACGTCACCTCACACAAGAAAGACGTCAAAGTCTTCCCTTGTCCGATATGTTTCAAGGAGTTCACCCGCAAGGACAACATGATCGCCCATCTTAAGATCATACACAAAAACCAGGCCAACGCCAACGAGCAAACCGCGAAGCAAGAGTCTTAA

Protein sequence:

>DPOGS204256-PA
MATQRFCLRWNNHQTNMLSVFDQLLHAETFTDVTLAVEGQLLKAHKMVLSACSPYFQALFVNHQEKHPIVILKDVPYSDMKSLLDFMYRGEVSVDQERLTAFLKVAESLRIKGLTEVNEEKCDIPALTNSLIQQQQGNTAHTPPPQLHRIHPYMHQKRPASGIPSGGAPPNLLMPLLGNALMQPKRKRGRPRKLSGSSSDALNTAASPPGEFIPESASHASSNRAGDQLIRGSPEMLEVKMSMDSFNAEDGVTSGGEDAGEALMIDEGDDAQSNEAPQSGKDSESADNDKQPKEETHQNFPSNGPILSIENGAIKQEPACEQNDEYNEHIEYKYNPDRSRENSNSQDGPIKDVEDKIRLGRNLKPKNSKKLQPQMSKFRARTLFNQLSGLSNLNPALNSFDKFPPETVLMPALATQLFAAELEQNNLNMVNSEVSDLGQTNWEQRIFPSPIRKNNMANVGNYHEETNESVRDYCIKEGENVFRCKICARVYTHISNFCRHYVTSHKKDVKVFPCPICFKEFTRKDNMIAHLKIIHKNQANANEQTAKQES-