Monarch geneset OGS2.0

DPOGS209522
TranscriptDPOGS209522-TA2364 bp
ProteinDPOGS209522-PA787 aa
Genomic positionDPSCF300127 + 461356-482057
RNAseq coverage825x (Rank: top 16%)
Annotation
HeliconiusHMEL0162652e-16266.18% 
BombyxBGIBMGA007329-TA3e-10487.66% 
Drosophilabab2-PA1e-3250.00% 
EBI UniRef50UniRef50_UPI00017925232e-3653.72%UPI0001792523 related cluster n=1 Tax=unknown RepID=UPI0001792523
NCBI RefSeqXP_001950638.14e-3753.72%PREDICTED: similar to bric-a-brac [Acyrthosiphon pisum]
NCBI nr blastpgi|1936411108e-3653.72%PREDICTED: zinc finger protein 131-like [Acyrthosiphon pisum]
NCBI nr blastxgi|1571676091e-3424.42%hypothetical protein AaeL_AAEL002435 [Aedes aegypti]
Group
Gene OntologyGO:00055156.1e-21protein binding
GO:00036762.6e-11nucleic acid binding
KEGG pathway 
InterPro domain[7-130] IPR0113333.7e-27BTB/POZ fold
[30-122] IPR0130696.1e-21BTB/POZ
[35-130] IPR0002102.3e-17BTB/POZ-like
[484-514] IPR0130872.6e-11Zinc finger, C2H2-type/integrase, DNA-binding
[593-603] IPR0204783.7e-08AT hook-like
Orthology groupMCL25336 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209522-TA
ATGCAGGGGCTCGTGAGCGACAAGACGTTCCACTTGAAGTGGAACAACCACCTGCAGAACCTGAGCCAGCTGTTCACGACCATCTACTCGTCGTCGGCGCTCGCGGACGTGACGCTCTCCTGCCGTGATGGGACCCTCAAGGCGCACAAGCTCGTATTGTCGGCTTGCAGCCCTTATTTCGAACAGATATTCAAGGACAATCCATGCCAGCATCCGATTGTGATCTTAAAGGGGATTCCGTTCTCCGAGATCAATCTCTTGGTAGAGTTCATGTACAAAGGGTCGGTGGACGTCCAAGAGTTGGATCTGCAGTCGTTGATGCACACAGCTTCCGAGTTGGAGATCAGAGGACTTGCATATGAGGCTCGTGACAATGCAGCACAGTTGTTAAACGTTAACTTGGAGTATCCAACATACACTCAGAATGCTACCACAACAGCAACGACCACAGCGACCGCAGTCTCAACATCACAGACATACCCACAGACCAGAACAGACGTGGAGAGGTTGAAACAGATCCACATGTACCAGCAGTCCATGATGCGTGCTGAGGAGATACGGCGACGGCGGCGAGAGGACATGCCGGGGACCCAGACAGCCGTCAACAACATATTAGCAGCGGCTGCCAGGGAGATGGAGGAGGCGAGGCATGCGGCCCGCGGGGAGAGGGTCGTGGCCAGCATACAGACGGAACAGGAAGTAAGTGGAGAGTTCAACAATTCTTTAAGAACCGTTACACAGCTCGCGTCTGATGAAAACATCCACATGTACCAGCAGTCCATGATGCGTGCTGAGGAGATACGGCGACGGCGGCGAGAGGACATGCCGGGGACCCAGACAGCCGTCAACAACATATTAGCAGCGGCTGCCAGGGAGATGGAGGAGGCGAGGCATGCGGCCCGCGGGGAGAGGGTCGTGGCCAGCATACAGACGGAACAGGAAGCAACATCCACATCATCAGCGTCGGTAGCGGCGACGTCCACACAACACGACGTGAAACGAGAGTACGAGGACGAGAGACCGAAGAAACGGGTCTCCATCATGTCACCAGACGACGAGAAGACGAGGGTCAAGAAAAAGACCATGACGGTACGCTTCCAAGAAGACAAGACTGACGACGGAACACAGAAGATTACATCACCAGGCCCCAACACAAACCAGCCGGTCCACATTCAGACTAACCACGGCGAATCAAAAAACGACAGCGAAATGAAGTCGGGTGGCATCAAAGTGGTTCAGCTGGCCGTGCTGCAGAAACAGAACCAGACCAACAACGAGCACGACGACTCTGACGACAAGGCGGGCAAGCTAGTGATGGACGACGATGAGGAGACGGATTCCAGCGGGTCCATATCAGACTCGGCGGCGAACGAGTCGCAGGACGGGCGGCCGCACGTCTGTGATATATGTGACGTCAGGTTCGCTCGGTCGTCTCATCTCTCCCGCCATCGCCTCACACACACCGGCGAGAGACCCTTCACATGCGGAGGATGCGGGCGCTCCTTCGCGCGCTCCGACAAACTGAGGGTGCACACTAAAATATGCGAACGGGAGGATCCGACAGTGGACATGGCCAATGAATCAGTGAAACGTGAGAACAACTCGGAAGTGATGTCAGGGATGGTGCGCACCACTCACCGGGAGTCAGGACACCTGGTGTTCACGCCCAGTGGGGACGTCATGCTGCCTCCGCGGCCCGGCAACCCTGTAGTACTCAATTCGACCCTGGAACCCGTGAGGAACGAACTCAATGCCTCGGACTTAGTGGACCCGCCGCGCAGGGGCCGCGGCAGGCCGAGGAAGACCCCCCTGCCCCTCACGCCCAAGATCAAGAAGAGGAGGGGACGACCGCCCAAGACATCACTCGATATGGAAGGCTGTGTTCTGACAGGCGGGTCGGTCTCCGGCAGCGCGCTGGCCGGACATCAGCTCATGCTGAAAAGGAAGCGAGGACGCCCGCCAAAGAACTACTTCCTGTCACAGCCGGACGGACGACCCAACGAGAACTACAACGCGACCAACCTGCCGTTCGGGGACTTCTCCTACCTGACCGAGATGATGTACAACCCGCTGGCGTACCCCTACGTCACCGTGGACCCCGACCGGAACGTGGACGCCGACGCCGCCGACACGTCGCGCGGCACCATCGACGTGTCCGACACCAGCTCCGACGACGACTCCGAGGCCGACCGCCGCCGCGTCATGACGGTGGGCGACTGCCAGATAGTGAAGCTGCCGCCGGCCGACGAGCGCGAGGAGCGCGACGACTCCCCGCCCGACGAAATCCCGCCCCCCACCAGCTCGGTCACCATCACGCCCATCACCACCGTCGGCGACTGCACGCTCCGGCCTCACAACGCATAA

Protein sequence:

>DPOGS209522-PA
MQGLVSDKTFHLKWNNHLQNLSQLFTTIYSSSALADVTLSCRDGTLKAHKLVLSACSPYFEQIFKDNPCQHPIVILKGIPFSEINLLVEFMYKGSVDVQELDLQSLMHTASELEIRGLAYEARDNAAQLLNVNLEYPTYTQNATTTATTTATAVSTSQTYPQTRTDVERLKQIHMYQQSMMRAEEIRRRRREDMPGTQTAVNNILAAAAREMEEARHAARGERVVASIQTEQEVSGEFNNSLRTVTQLASDENIHMYQQSMMRAEEIRRRRREDMPGTQTAVNNILAAAAREMEEARHAARGERVVASIQTEQEATSTSSASVAATSTQHDVKREYEDERPKKRVSIMSPDDEKTRVKKKTMTVRFQEDKTDDGTQKITSPGPNTNQPVHIQTNHGESKNDSEMKSGGIKVVQLAVLQKQNQTNNEHDDSDDKAGKLVMDDDEETDSSGSISDSAANESQDGRPHVCDICDVRFARSSHLSRHRLTHTGERPFTCGGCGRSFARSDKLRVHTKICEREDPTVDMANESVKRENNSEVMSGMVRTTHRESGHLVFTPSGDVMLPPRPGNPVVLNSTLEPVRNELNASDLVDPPRRGRGRPRKTPLPLTPKIKKRRGRPPKTSLDMEGCVLTGGSVSGSALAGHQLMLKRKRGRPPKNYFLSQPDGRPNENYNATNLPFGDFSYLTEMMYNPLAYPYVTVDPDRNVDADAADTSRGTIDVSDTSSDDDSEADRRRVMTVGDCQIVKLPPADEREERDDSPPDEIPPPTSSVTITPITTVGDCTLRPHNA-