Monarch geneset OGS2.0

DPOGS214291
TranscriptDPOGS214291-TA1221 bp
ProteinDPOGS214291-PA406 aa
Genomic positionDPSCF300014 + 2270710-2272875
RNAseq coverage154x (Rank: top 53%)
Annotation
HeliconiusHMEL0114321e-14076.09% 
BombyxBGIBMGA012517-TA2e-2848.28% 
Drosophilattk-PA2e-2949.58% 
EBI UniRef50UniRef50_UPI00022CA2093e-5232.01%UPI00022CA209 related cluster n=1 Tax=unknown RepID=UPI00022CA209
NCBI RefSeqXP_393428.18e-3035.98%PREDICTED: similar to Broad-complex core-protein isoform 6 [Apis mellifera]
NCBI nr blastpgi|3504194169e-5232.01%PREDICTED: zinc finger and BTB domain-containing protein 37-like isoform 1 [Bombus impatiens]
NCBI nr blastxgi|3504194162e-5131.31%PREDICTED: zinc finger and BTB domain-containing protein 37-like isoform 1 [Bombus impatiens]
Group
Gene OntologyGO:00055159.3e-23protein binding
KEGG pathway 
InterPro domain[4-114] IPR0113331e-26BTB/POZ fold
[22-113] IPR0130699.3e-23BTB/POZ
[31-120] IPR0002101.9e-17BTB/POZ-like
Orthology groupMCL35014 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214291-TA
ATGGCAAACCAAGAAATCAGTTTAAAGTGGAATGGCTATCAAAATAATATTCTAAGTAATGTAAAGGAATTATTTAAAGATGAAAATTTATCCGACGTCACACTTGTTTCGGAGGGACAAAGTTTTAAAGCTCATAAAATTATCTTGTCGGCAAATAGTTCTGTTTTTAGGACTATCTTTCAGCAAAACCCCCAGAAAGATCCAATTATAGTGCTCCATGACATCAACACGGATTCCTTGAAAACTTTGTTGAAGTTTATGTACAATGGAGAGGTTAATGTAACAGAGGAGTTTCTACCAGTTCTTCTCAAGACTGCTGAGAGTTTGAGAATATGTGGTTTGTCTGCTGGTAATGATGCTACTAGAGACGATGAGAAAAATGCAACATCAACACAGTCTTTGCCAAAGAAACGCAAGAAAAGTGAGTTGGATGATAGCAATAATAAAATTAAGAAGGCAGCTCCCTGCCCTCCCAAACCAGATTCAGTTGTTGCAGTCCCAACATGCGACCTGTTAAAAACCCCTCAAATAATGCCAAAAGTTGAACCTGTGGATTCACCATTAACTGATTATTCCGGAGAAAATACCACTGACACCGACATAGCTGTTCTTGAAGATACATTGGAAAAGAAATACTCTATAAGCCCAGAAAATCCTACAAGCAAATCTGTGCGTGCAAAGGGTATAAGCGATTCTAATGTGAAGAAATGGATCAACGAAGTCAGCAGCACACAGCCTTGTTTGAATGTTGAGAAAAATGTTGATCAAGAAGATGAAAAAGAGCTTGAAATTGATAAAGAAGTTGAAACTGTGTTGCAAAGTGGAACAATTGTAGAGTCATTGAGTATAACAACTGAGAATTTGAATTTTGTTTCCAGTGAGGCTCACAGTATGAAAGATAAAACGAGTACCTGTTATGCAAACCCGTCCTTTCCATGCCCGTTTTGTCCGCGTGTTTATAATTCTTGGGGTTACCGCAGACGACATGTCAAATCCAGACATATGACCAATAGATTATCCTGTAAGTGGTGTGTATCTATTTTGCCATCAACCGGAGCTTGGTATTCCCATGCTACAAGATCCCATGGGGTTCCTCATGAGGAAGCCAGGAATTCACTCGTTGTAATGGTCGAGGCACATGCTGTACTAACATTGAATGAACCTAGTGTGGCCCAGTTATTGGGCCAAGTTGGTATTGATAGTGGTGAAAAGGCAAACTAG

Protein sequence:

>DPOGS214291-PA
MANQEISLKWNGYQNNILSNVKELFKDENLSDVTLVSEGQSFKAHKIILSANSSVFRTIFQQNPQKDPIIVLHDINTDSLKTLLKFMYNGEVNVTEEFLPVLLKTAESLRICGLSAGNDATRDDEKNATSTQSLPKKRKKSELDDSNNKIKKAAPCPPKPDSVVAVPTCDLLKTPQIMPKVEPVDSPLTDYSGENTTDTDIAVLEDTLEKKYSISPENPTSKSVRAKGISDSNVKKWINEVSSTQPCLNVEKNVDQEDEKELEIDKEVETVLQSGTIVESLSITTENLNFVSSEAHSMKDKTSTCYANPSFPCPFCPRVYNSWGYRRRHVKSRHMTNRLSCKWCVSILPSTGAWYSHATRSHGVPHEEARNSLVVMVEAHAVLTLNEPSVAQLLGQVGIDSGEKAN-