Monarch geneset OGS2.0

DPOGS212369
TranscriptDPOGS212369-TA1605 bp
ProteinDPOGS212369-PA534 aa
Genomic positionDPSCF300019 + 293975-299045
RNAseq coverage989x (Rank: top 13%)
Annotation
HeliconiusHMEL0056592e-10481.78% 
BombyxBGIBMGA004640-TA2e-5756.77% 
Drosophilalola-PI3e-6078.95% 
EBI UniRef50UniRef50_B4MIR31e-5880.15%GK10673 n=3 Tax=Drosophila RepID=B4MIR3_DROWI
NCBI RefSeqXP_002017656.13e-6080.92%GL17189 [Drosophila persimilis]
NCBI nr blastpgi|1951534855e-5980.92%GL17189 [Drosophila persimilis]
NCBI nr blastxgi|2555228052e-6042.93%longitudinals lacking isoform 6 [Tribolium castaneum]
Group
Gene OntologyGO:00055155e-25protein binding
KEGG pathway 
InterPro domain[4-117] IPR0113335.9e-31BTB/POZ fold
[26-117] IPR0130695e-25BTB/POZ
[32-127] IPR0002101.3e-22BTB/POZ-like
Orthology groupMCL25048 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212369-TA
ATGGATGATGACCAGCAGTTCTGTCTGCGGTGGAATAACCACCAATCGACGCTGGTTTCAGTGTTTGATACTTTGTTAGTGAAAGAAATTCACGTGGACTGCACATTAGCTGCCGAAGGGAGAACACTAAAGGCACACAAAGTGGTTTTGTCTGCGTGCAGTCCATATTTCGAGAGTGTATTATCCGAACAGTTTGACAAACATCCAATAATCATTTTAAAAGATGTCAAATTTGCTGAACTGAGAGCAATGATGGATTATATGTACCGCGGAGAGGTGAACATCTCTCAAGACCAACTGGCTGCCCTGCTGAAGGCAGCAGAGTCACTTCAAATCAAGGGTCTCTCAGATAACAAGCCATCCAGGCCACCGTCGCGCCCTCCCCCAGCACCCGCACACCCGCCTCGCGCTCCGTCTCCACCTCCACAAACGAGAGATGGCAGCTCGGACAGTCGCTCGGGGTCTCTGAGTCCATCGAGACGGCGTAAGAAAGCGCGGCGCACGTCCAGCCCCGTGCCACCCGCGGGCGGGCGCGGGCGGCGGTCGGGCGAGCCGCCCTGCAAGGAGGAGCTGGCCCGGGACGTGGAAGACCTCACGCTGGACGACGCTCACACCCCGCCCGCGGACCACAACGATGTCGTGCGCAATTTTCAATGGCATATGGAAAGATCTCAAGATGAAATTATGAATTCAAATGACAGCGTTCGAGAGAATCAAGAACCATCCACCATTCTAACAAGTAACTCGAGTTACAATAAAACCCCTTTTCCATTAACCCGTGGATATCTAAAACATGGTAGCGTAGAAGATCACGAGAGACTACCGGCCGAGAGGGGGGAGGCGAGGGTTAAGAGAGCTTATAGAAGGAGATTGCCCGGCAATAGGAAGTATAAGAAGTCTAGGGAAGACGGCCCGGCCGGAGGGAGGAAGGAGCACGCGGGTAACGAGAGACGGTACCTCTGTTCCATATGCAAGAAGAAACAGTACAAGTACAGGAGGAACAAGCTGCGCCACGAGAAATACGAGTGCGTCACGGGACCGCAGTTCGCCTGCCAAAAGACCGACCGAAAATCGGACTGTACGCTTTTCCTCATTTCAGGTCGAAGGACGACCCCCAGGCATAGTATTATGACGCGGAGTAAAAAAGTGAGCGAGGAGACGTCGTTCGCCACGACGCTCGCCATCCTCAAGAACGTGAGCGCCGCGGACGTGGCGGGCGACGACGTCAAGGAGAACGACCGCCGACCGCTTGGCGCCCAACCCTCGCCTGCCACAGACCCCGCGCCCAAAGCGGAGCCAGAGCCCGAGCCCGAGCCCAAGACTGAGCCGGAGGCGCGCTACCACGTGTTCCCTCGCGGACCGCAGCAGCTGCCGTGCGCTCAGTACAAGACCGACAAGGGCTACCGCTGTCCCAACTGCCAGCGCTGCTACAACGCCCGCAAGAACCTCGTGCGGCACGTGACGCTGGAGTGCGGGCGGGAGCCGCAGTACAAGTGTCCGCACTGCTCCTACAGCAAGCATCGACGCAACGAGCTCAAGAAGCACATAGAGAAGAAGCACCCCGAGCTGGCGCCGGCCGCGACGCCCGCCGCCCTGCGCGCCTGA

Protein sequence:

>DPOGS212369-PA
MDDDQQFCLRWNNHQSTLVSVFDTLLVKEIHVDCTLAAEGRTLKAHKVVLSACSPYFESVLSEQFDKHPIIILKDVKFAELRAMMDYMYRGEVNISQDQLAALLKAAESLQIKGLSDNKPSRPPSRPPPAPAHPPRAPSPPPQTRDGSSDSRSGSLSPSRRRKKARRTSSPVPPAGGRGRRSGEPPCKEELARDVEDLTLDDAHTPPADHNDVVRNFQWHMERSQDEIMNSNDSVRENQEPSTILTSNSSYNKTPFPLTRGYLKHGSVEDHERLPAERGEARVKRAYRRRLPGNRKYKKSREDGPAGGRKEHAGNERRYLCSICKKKQYKYRRNKLRHEKYECVTGPQFACQKTDRKSDCTLFLISGRRTTPRHSIMTRSKKVSEETSFATTLAILKNVSAADVAGDDVKENDRRPLGAQPSPATDPAPKAEPEPEPEPKTEPEARYHVFPRGPQQLPCAQYKTDKGYRCPNCQRCYNARKNLVRHVTLECGREPQYKCPHCSYSKHRRNELKKHIEKKHPELAPAATPAALRA-