Monarch geneset OGS2.0

DPOGS207933
TranscriptDPOGS207933-TA1929 bp
ProteinDPOGS207933-PA642 aa
Genomic positionDPSCF300090 - 507252-511945
RNAseq coverage8x (Rank: top 86%)
Annotation
HeliconiusHMEL0070719e-6537.08% 
BombyxBGIBMGA000376-TA2e-11175.40% 
DrosophilaCG15269-PA2e-9578.00% 
EBI UniRef50UniRef50_UPI00021A72C89e-9476.73%UPI00021A72C8 related cluster n=3 Tax=unknown RepID=UPI00021A72C8
NCBI RefSeqXP_001121357.14e-9552.16%PREDICTED: similar to CG15269-PA [Apis mellifera]
NCBI nr blastpgi|3407181513e-9376.73%PREDICTED: hypothetical protein LOC100644526 [Bombus terrestris]
NCBI nr blastxgi|1951155734e-9768.00%GI13302 [Drosophila mojavensis]
Group
Gene OntologyGO:00055152.2e-17protein binding
GO:00036765e-14nucleic acid binding
GO:00082702.9e-06zinc ion binding
GO:00056222.9e-06intracellular
KEGG pathway 
InterPro domain[1-107] IPR0113331.1e-18BTB/POZ fold
[5-106] IPR0130692.2e-17BTB/POZ
[11-108] IPR0002107.1e-17BTB/POZ-like
[551-578] IPR0130875e-14Zinc finger, C2H2-type/integrase, DNA-binding
[531-553] IPR0070872.9e-06Zinc finger, C2H2
Orthology groupMCL17473 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207933-TA
ATGTGTGAATGGCTCTCGTGTGAAGAATGGTGCGATGTATTTTTATTGTGCGGTGGTCGGGTATTCAGTGCTCACCGAGCAGTTCTGGCAAGCGTCAGCAAGTTTCTCAGGAAAATCCTTCTTTCATGTTCAGTAGAAGATTCACCGACGTTTATTGTTATGCCCGAATTTGATTTTGATACCATGTCTTCCGTTCTTCATTACATCTATAACGGTGAAGTAGCCATCAGCAAACACCAATTACACACATTTTTGGATATCATGAACGCTTTACAAGTTTTCGTTGATACTAGAGATTTAATGAAGCATTTAAAAAATAATAATTATGTTTTTGTTGAGGATGATTATACTTTCAAAAAAGAAAATATAACCAGCAAAGATAATGTCACCTTAAGGAAAGAAATTCAGAAAAAATCTAATGTAAGGCGTCGGTTTGATAAAGTAAATTTAGACCGAAGTAATATAGATTCAAATTTTTATAAGACCAAAGAAATCCGGCTCGGATATCTGAAATCTCTCGAAGACCGACAAAGCCAGCCATTTTCTCGTGATTTATTTATTGAGACATATAGTCTGAACCGTAATGGTAACGGTGTTACAGAGGGCGTGCTCGCAATATCAAGTGAACGCTATCGTGAAAAAGGATCGCTTCTCGAGAAGAATTCTAATACTGAGCATCACAAGGAAGTGGCGACAGAAACAACTTTTGACATAGGTTCTAGGGATAATTTTAACTTTTTATGTAACACAAAACATTTTGAAATTACAAGTAAGGAAAATATTGAAAGTTTTGTTGGTATGTACTTAGGGCACAAAACTAATGTGGAAGTTGAGAAGATTCCTGAAAATAATTACCTCAGTGACATTCAAGGCGAAGAGATATCAAGAAAAACTGTCGATCAAGAACATACTTCTATAAATTGTAGTATTATTTCTAGGCATAGTTTAAATTTAGATACAAAAAATATGCAAAATGCGGTATCTAGGGTGTCGGAAGTAAATAGTGAATTTAATAAAGGCAAAGAAGCTGTTAGTAAAGTGACCATACTTAATGAGGTATTACAGAGCCCTTGGAGACCTCGGCTGTCGCTTACTTTTACTCCTTATGCTAAGAAATGTTTGGACTGGAAGCTAGATAAAGACAAAACCATACAGTCAAAACAAAACGGCGTTTCGATTAATGATAAAAACAACAATAATAATGAATCTACCAAGATTGAGAACAATAATAATGAGGCCACGAGCAATATAAAAGAAATGAAACCGAGTGTTTCAAAAGACGTAACAGCCGACAACAAAGACGAACATTTGACTGGATCTAAATCAACTCGCTACACCTGTACTGAATGTCATAAAACATTTTCACAATTACGTAATTATAAATATCATATGTCAGTGCATCGTGGTACAAAGGAGTTCGCCACAACGTGTCCAGTTTGCGGGAAATTTTTCAATGATAAAGGATATCTCAGTAGCCATATGAAAATACATAAAAACCGTAAGGAATATAAGTGTAACATGTGTCCAAAATCGTTCAATCAACGAGTTGCTTACAACATGCACGTTCGTATACATACAGGCGTCAAACCTCACGTATGCGATGAATGTGGTAAAGCGTTCTCACGCAAGATGTTACTGAAGCAGCATCAACGGACCCATAGCGGGGAGCGACCGTACGCTTGTCAGCATTGTAACAAGAGATTTGCTGATAGATCTAATATGACCCTGCACTTAAGATTACATACAGGAGTAAAGCCGTTCGCTTGTACACTATGCCCGAAATCGTTTACCAAGAAGCATCACCTAAAGTCTCACCTCAACTTCCACACCGGCGACAAACCATATACTTGCCCACGCTGCAAACTGGCTTTCACACAGTCATCCAACATGAGAACGCATATGAAGAAATGTGACGTTCATAAAGATTAA

Protein sequence:

>DPOGS207933-PA
MCEWLSCEEWCDVFLLCGGRVFSAHRAVLASVSKFLRKILLSCSVEDSPTFIVMPEFDFDTMSSVLHYIYNGEVAISKHQLHTFLDIMNALQVFVDTRDLMKHLKNNNYVFVEDDYTFKKENITSKDNVTLRKEIQKKSNVRRRFDKVNLDRSNIDSNFYKTKEIRLGYLKSLEDRQSQPFSRDLFIETYSLNRNGNGVTEGVLAISSERYREKGSLLEKNSNTEHHKEVATETTFDIGSRDNFNFLCNTKHFEITSKENIESFVGMYLGHKTNVEVEKIPENNYLSDIQGEEISRKTVDQEHTSINCSIISRHSLNLDTKNMQNAVSRVSEVNSEFNKGKEAVSKVTILNEVLQSPWRPRLSLTFTPYAKKCLDWKLDKDKTIQSKQNGVSINDKNNNNNESTKIENNNNEATSNIKEMKPSVSKDVTADNKDEHLTGSKSTRYTCTECHKTFSQLRNYKYHMSVHRGTKEFATTCPVCGKFFNDKGYLSSHMKIHKNRKEYKCNMCPKSFNQRVAYNMHVRIHTGVKPHVCDECGKAFSRKMLLKQHQRTHSGERPYACQHCNKRFADRSNMTLHLRLHTGVKPFACTLCPKSFTKKHHLKSHLNFHTGDKPYTCPRCKLAFTQSSNMRTHMKKCDVHKD-