Monarch geneset OGS2.0

DPOGS203320
TranscriptDPOGS203320-TA1134 bp
ProteinDPOGS203320-PA377 aa
Genomic positionDPSCF300003 - 851864-855822
RNAseq coverage211x (Rank: top 46%)
Annotation
HeliconiusHMEL0088143e-10778.52% 
BombyxBGIBMGA002080-TA1e-15674.50% 
Drosophilattk-PA1e-3353.72% 
EBI UniRef50UniRef50_D6WVX11e-4461.65%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WVX1_TRICA
NCBI RefSeqXP_971758.17e-4561.65%PREDICTED: similar to Mod(mdg4)-heS00531 [Tribolium castaneum]
NCBI nr blastpgi|2700117765e-4461.65%hypothetical protein TcasGA2_TC005851 [Tribolium castaneum]
NCBI nr blastxgi|2700117762e-6243.08%hypothetical protein TcasGA2_TC005851 [Tribolium castaneum]
Group
Gene OntologyGO:00055152.7e-24protein binding
KEGG pathway 
InterPro domain[4-115] IPR0113334.7e-29BTB/POZ fold
[22-116] IPR0130692.7e-24BTB/POZ
[32-127] IPR0002105.1e-21BTB/POZ-like
Orthology groupMCL20689 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203320-TA
ATGGCGATCCCTGAACAGTTTTCGTTACGGTGGAACGATTTTCACTCAAACTTGTCTCAATCTTTCCAAGCTCTTTTGGAAGGCGAAGATCTGGTGGATGTAACATTAGCGGCGGGAGGGCAGTATGTTCAGGCTCACAAGCTCATACTTTCGGTCTGCAGCCCATACTTCAAGGAGCTGTTTAAGATGAACCCGTGTGAACATCCGATAGTTATACTGAAGGATGTAGCTCACCAGGAACTGAAACAGCTTCTGCAATTTATGTACCGCGGAGAAGTACATGTGAGACAGCAAGAGCTCTCTGGATTTTTACACACAGCCGAGTTACTTCAAGTTAAAGGGCTCACGGGTGGTAGAGAGAGAAGTGAATCTCCCCAGCCAGTGGTTGAAAGTGAGTCAACTAACACTGGTCGTCAGCAAGTACCTGAATCTGGTGGTGATAGTCTACCTGAATGGGTCCCACCTTCCGAGGAGGCGACTGTATCAGATGTTTCACCAGAATCAGCCAGCTCGGGGCCTCCCACCGAAGAAGCAACTCGAAGCCCACTGAAGAGATTATTGAAGAACACACCAGGCAAGACAACCTATAATATGAAGAAGAAGCCTCGACCTGTCAACGATAGCCCCACTCATGCTGAGAACACGGAATATGCATCTGACAGTGAGCCCATGATCGACTTCGACAGCGACATGTTCAATAATGTCCTATTGCCAGATTCAGCCAAAGACTCGGGATGGAACTGCAAAACCGGTGGTGTTAAGTGTCCGTCCTGCCATCGGTTTTTCGCGAATCGGTACAATCTTAAAGTGCATATACGTGACAAGCACGACACTAGAGAGGGTACGCTACAATGTGATATATGTCAAAAACGTATGCGTAATCCATCATGCCTCCGGGTTCATATGTACCATCATCGGAAGCAGGCTACATATTTGGCACAACTCAGCGCCCAAGGCGATCAAATGAGACACGCCGTTCAGAACATGGTTGGCAATAAATGGCGATCTGAAGCAAACGCGGAGTTCAGGGATGCTGACAATTTCCCAGCTGTCACTGAGAATGCCAAGCTGCAGGCGGAAAGGCCAAGCGAGGCTGCTCTGCCTAAGTCAGAGACTAATCAAGAGGCCGTATGA

Protein sequence:

>DPOGS203320-PA
MAIPEQFSLRWNDFHSNLSQSFQALLEGEDLVDVTLAAGGQYVQAHKLILSVCSPYFKELFKMNPCEHPIVILKDVAHQELKQLLQFMYRGEVHVRQQELSGFLHTAELLQVKGLTGGRERSESPQPVVESESTNTGRQQVPESGGDSLPEWVPPSEEATVSDVSPESASSGPPTEEATRSPLKRLLKNTPGKTTYNMKKKPRPVNDSPTHAENTEYASDSEPMIDFDSDMFNNVLLPDSAKDSGWNCKTGGVKCPSCHRFFANRYNLKVHIRDKHDTREGTLQCDICQKRMRNPSCLRVHMYHHRKQATYLAQLSAQGDQMRHAVQNMVGNKWRSEANAEFRDADNFPAVTENAKLQAERPSEAALPKSETNQEAV-