Monarch geneset OGS2.0

DPOGS215980
TranscriptDPOGS215980-TA1542 bp
ProteinDPOGS215980-PA513 aa
Genomic positionDPSCF300078 - 444551-446423
RNAseq coverage201x (Rank: top 47%)
Annotation
HeliconiusHMEL0086790.093.76% 
BombyxBGIBMGA001210-TA0.087.98% 
Drosophilarib-PA3e-4565.55% 
EBI UniRef50UniRef50_UPI0002061A0D6e-6934.99%UPI0002061A0D related cluster n=1 Tax=unknown RepID=UPI0002061A0D
NCBI RefSeqXP_974222.24e-8138.28%PREDICTED: similar to ribbon [Tribolium castaneum]
NCBI nr blastpgi|2700051503e-8138.62%hypothetical protein TcasGA2_TC007162 [Tribolium castaneum]
NCBI nr blastxgi|1583007892e-11546.82%AGAP011902-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00055152.7e-20protein binding
GO:00036774.5e-11DNA binding
KEGG pathway 
InterPro domain[25-142] IPR0113332.3e-27BTB/POZ fold
[53-149] IPR0002102.7e-20BTB/POZ-like
[47-142] IPR0130691.4e-19BTB/POZ
[337-398] IPR0090579.2e-14Homeodomain-like
[345-388] IPR0078894.5e-11Helix-turn-helix, Psq
Orthology groupMCL15310 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215980-TA
ATGTTTCCTGTGTCAACTTACGAAGGCGTCCTGCCAGAATCGCAATTCAAAAAGATGGGGAGCAGTGAAGGCCAGCAAACTTTTTGTTTGAAATGGAATCACCATAAAACAAATCTGGTAGAAATATTGGAAGCTTTAATAAAAGTTGAAACATACGTCGACTGCACTTTAGTCGTCGATGATCAAGTTACGTTTAAAGCACACAGAGTTGTGCTCGCCGCTAACTCTCCATATTTTCAATCGATCCTAGCTGATGTGCCTATGGATCATTGTAGCATACTTTTCCCCGGAGTCAAGGATTTTGAAATGAGAGCCCTTCTCGAATACATGTACACGGGTGAAGTCAATGTCACACAAGCGCATATACCACGAATCATGAAAGTGGCTGAACAACTTGAAGTCAAAGGTTTGTTTGATATGACGGAGCTGAGACGCCGCCCTGGAAGCAGCGAACGTACCCCCGCTGCCTCCCCACCACGAGTAGTACCGGCTGCTCCCTCTAGTGTTTCTCCTCCTGCACCAAATAATCGCTGGCCACCACCGCCTACAGCTCCAGTACTTTCGGCTGCCTACGACTCTGCTGATATGAATCCATTAAAACGCAAAAAGTTATCAAGTATGCTAGCCACTCGTGATACCCCAATTTTAAGAAATGTTCTAGCACAGACAACTCCTGTAGATTCCTCCCAACCTATGTCTCTTGTCTGCCATCCTGTTAGTCAGCTTGAGTCAACACGTTTGCATTCAAACGGATCAGCTCATGAATTAGATCGGTCTGTAAGCCCCCAAAGACCTTTCGACTACAGGCCTCGTCGGTTGTCGTCTAGGGCGTCATCTCCTCATTATAATCGCTCAGATCGTTCAGAAGATGCTCATTCACCATACACAGAGCGATCTTTTGAGGAAGATAATCAACGTACTTTCCACCCTTCTCCCCCACCAGCTAATTTCCAACAAGACGTGCGAGCTGGGCTAGCGCCATATGTACCACCGCAACAAAAACCGGAATGGAAACGATATAAGCAATACACGCGATCAGACATTATGTCTGCTATAGAATGTGTGAGGAATGGCATGAGCGCTTTACAGGCATCGCGTAAATATGGCGTGCCCTCACGTACTCTATACGACAAGGTAAAAAAACTTGGTATTACAACAAGTCGTCCCATGAGCCGCGGAGTTAAAAGGGAATCGAATGGAGCTGCTTTCCCTTACGGTTTAAGCGGCACTGGTGGTAATGATGATGTAACACCTACTACTCCGCTCATCGACCCGTCCTTCCTACAACAAGCATTAGAAGGCGCTACAAGAGACGGGGGGCGCGAAGCTTTACACGCTATGGCCTTAGCTGCAGCAGCACATGCAGCATTGACTCCTCGAACCCCACCACGCTCAGCGCCACAATCACCAAGAACTCCACCACCTGACGATGACCATGTCGAGGACTTGTCAGTCGCGCGCAGACGCGATCCAGACCCTCCATCCGGCGTCATTGTCCCGCCACGTAATTTTGCTCTAGATTGCAATAGCGAAAGGGATTAA

Protein sequence:

>DPOGS215980-PA
MFPVSTYEGVLPESQFKKMGSSEGQQTFCLKWNHHKTNLVEILEALIKVETYVDCTLVVDDQVTFKAHRVVLAANSPYFQSILADVPMDHCSILFPGVKDFEMRALLEYMYTGEVNVTQAHIPRIMKVAEQLEVKGLFDMTELRRRPGSSERTPAASPPRVVPAAPSSVSPPAPNNRWPPPPTAPVLSAAYDSADMNPLKRKKLSSMLATRDTPILRNVLAQTTPVDSSQPMSLVCHPVSQLESTRLHSNGSAHELDRSVSPQRPFDYRPRRLSSRASSPHYNRSDRSEDAHSPYTERSFEEDNQRTFHPSPPPANFQQDVRAGLAPYVPPQQKPEWKRYKQYTRSDIMSAIECVRNGMSALQASRKYGVPSRTLYDKVKKLGITTSRPMSRGVKRESNGAAFPYGLSGTGGNDDVTPTTPLIDPSFLQQALEGATRDGGREALHAMALAAAAHAALTPRTPPRSAPQSPRTPPPDDDHVEDLSVARRRDPDPPSGVIVPPRNFALDCNSERD-