Monarch geneset OGS2.0

DPOGS206049
TranscriptDPOGS206049-TA2460 bp
ProteinDPOGS206049-PA819 aa
Genomic positionDPSCF300028 - 1085076-1095972
RNAseq coverage1610x (Rank: top 8%)
Annotation
HeliconiusHMEL0028200.074.97% 
BombyxBGIBMGA000503-TA8e-14085.38% 
DrosophilaCG10984-PB6e-6735.88% 
EBI UniRef50UniRef50_D7EJI04e-10340.79%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D7EJI0_TRICA
NCBI RefSeqXP_969950.22e-14340.30%PREDICTED: similar to CG10984 CG10984-PC [Tribolium castaneum]
NCBI nr blastpgi|1892424264e-14240.30%PREDICTED: similar to CG10984 CG10984-PC [Tribolium castaneum]
NCBI nr blastxgi|1892424261e-14839.86%PREDICTED: similar to CG10984 CG10984-PC [Tribolium castaneum]
Group
Gene OntologyGO:00055152.1e-05protein binding
KEGG pathway 
InterPro domain[64-189] IPR0206834.2e-38Ankyrin repeat-containing domain
Orthology groupMCL16094 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206049-TA
ATGCCGTCGACGTCTCGGTCGCGGGGCTCTGGGCGCGGGGCTGATGGCGCGCCGAAGCCCCAAGTCATAGCCCCCATGTCGGAGCGGCAGCAGTTGGCACTGGTGATGCAAATATCATCCCAGGATGCCCCTCCAGCCGCTCCGACTCCGGCCGAGGAACGGAAGCATACTCGACAGCAACGAAATGAACGCGGAGAAACACCATTACATGTAGCAGCTATAAGAGGAGACCATGAGCAAGTCAAGAAGCTTCTTGATCAGGGTCAAAACCCAGACATTCCAGATTTTGCCGGCTGGACAGCGCTCCACGAAGCCTGTAGCTACGGTTGGTTTGAAGTGGTGACTGTGCTGGTAGAGGGCGGTGCTAATGTGAACGCCAAGGGCCTAGATGACGACACACCACTCCATGATGCCACCACCTCCGGGAATCTTAAGATGGTTGAACTGCTCATAGAACGTGGGGCTGATCCGTTCGCAAAGAACGCAAAAGGAAAAACACCCTCAGATTATGCTACACCACACATCTTTGAATACCTCCAGTCTTTGAAAGATAACGCGAGAACTGCAAATACAAGTCGTGCAAGGGATGACAGCAACAAGAAGGGCAATGCAAGTGCAAACAGCGAATCACACAAGCGAAGTAACATGGAGGGCCTAGCGCAGGGCGAACATTTGGAGGGCAAGGAAACCTCCACTCAAGAGAAAGGCAATGTTGAAAGTAGAGATGATGGAACATCACAGCTGAGTAGTGGTGTTGTATCCTCTACTCAGGATGAAGTCAATCCTGGCAACAAGAGGTTGTATACAGAAGACGGTCAAGAGGCAGATGTGACAGAGGAAGATGTTTCAAAACGGAAGAAGAGAAAAGACAATGAAGAAAAGGAACCCATGGCCAAACCAGCACCTGTGGCTCGAGGTGGTGCCGGCCGAATCCTAACTGGTAGCAAACCTCCGGGGCCGGCAAGTAAGACGGGCGCACAGCCGTTGGGCAAAGGAGCACAGAACACAAAGGGCGCGGGTTCTTCTTCTGGAAAAGGTGCGCAGGCGCAGGGGAAGGGGGGCGGCGCCAATGCCAAAGCTAACGCGCAGGGATCTCATCAGGGTAAATCAGGAGGTGCAAAGGGAGCATCAGGGTCACAAGCTGGCAAACAAGATCGGAAGAGTCCAGTCGCAAGCCCGAAGCCAAATCAGGGTAAGGATGACGACGAGGACTCCAAAAGCCAGGAATCGTCCGCACCGAAAGTGCCACCCCTGAAGATAGTAATACCAGGAGGGGCGGGTGGTTCCGGAAGTCGTAATGAACAGGAAGGCGATGGCAGTACGGGTCAGCGTGGTGGAGGCAAGGGTCGTGGTAGCCTGTCAACGCTGCCATATGTCATCCCATGCACCAGCGCTGACACTGGACAGACATCAGACAGCTCTGACAACTGTGAAGACAAGCGAACTGGGGAAGCCAAGGCCGGGCAAAGGGTTCTCCGCTCCCACAGAACGAATGACGGTGATAAAGATAAAGGGATGACCTCTCCGCTAAGAGGGTCGGAGAATCGCTCAGGCTCAGGGCAGAATCAGGCTTTGAATTCCAACAAAAGTCCACCACCCTCTGGGCAGGACGCTGACCACGCTACATCTACTTCGTCGGGTCCTGGTGGAAGATCAGAGTCAGCTGCTGGTCCGTCTGTGGAGTTGCATCCTCGTAAGCGGAAAATAAAAGCATCGAAAGATAATCATTCAAGAGACAGCAAGAACGAGCAAGTCCCAGACAGCACCACGCTTACTCACAACGTCACACATTCCAACCCCTATCAAATGTATATTCATATAAGGAAACAGATCGAGAGACGTCAGAAGGCTCTGTTTCCAGTGAAACCAAAGCCGCCTAAAGACTTCAACAAGTACCTGATGAACCGTTGCACTTACACTCTACAGAGTACTGTTAACCCAGAGCCTCAAGTTGAAATACCACCCAACCTACCATCACAAATGGTGAACGAGTTTTTGGCTCAAGAAAAGGAAAGGACAAGATTACGTATCCAACACCTTGTTGAGAAAGAGAAGCTTGTTTTGGCCGTCGAGCAGGAAATATTGAGAGTACACGGGCGCGCCGAACGCGCTGTGGCTAACCAGGCTCTCCCCTTTTCCGTGTGCACTATACTGCGTGATAAAGAGGTTTACAACGTGCTAGCACCGGAACAAGAGGAGAAACGCAACGCGCAGCGATCCCGTTGTAACGGACGTCAAATCAACTCGTGGCTCCAGGAAGTCGATGATAAATGGGAGAAGATCAAGGAAGGTATGCTTCGCCGTCAGCATACGGAAGCTGAGACCTTACACGCAGTACAGATCATGGGATGGGAATGGAAATTAAAAGAACACGGTCTGTGCGACTACAAGTCCACGCCGAAGATAGATCCGACACACGTACCACAGATACACGTGTCGAATTTTGACTTGCCCGCTTGA

Protein sequence:

>DPOGS206049-PA
MPSTSRSRGSGRGADGAPKPQVIAPMSERQQLALVMQISSQDAPPAAPTPAEERKHTRQQRNERGETPLHVAAIRGDHEQVKKLLDQGQNPDIPDFAGWTALHEACSYGWFEVVTVLVEGGANVNAKGLDDDTPLHDATTSGNLKMVELLIERGADPFAKNAKGKTPSDYATPHIFEYLQSLKDNARTANTSRARDDSNKKGNASANSESHKRSNMEGLAQGEHLEGKETSTQEKGNVESRDDGTSQLSSGVVSSTQDEVNPGNKRLYTEDGQEADVTEEDVSKRKKRKDNEEKEPMAKPAPVARGGAGRILTGSKPPGPASKTGAQPLGKGAQNTKGAGSSSGKGAQAQGKGGGANAKANAQGSHQGKSGGAKGASGSQAGKQDRKSPVASPKPNQGKDDDEDSKSQESSAPKVPPLKIVIPGGAGGSGSRNEQEGDGSTGQRGGGKGRGSLSTLPYVIPCTSADTGQTSDSSDNCEDKRTGEAKAGQRVLRSHRTNDGDKDKGMTSPLRGSENRSGSGQNQALNSNKSPPPSGQDADHATSTSSGPGGRSESAAGPSVELHPRKRKIKASKDNHSRDSKNEQVPDSTTLTHNVTHSNPYQMYIHIRKQIERRQKALFPVKPKPPKDFNKYLMNRCTYTLQSTVNPEPQVEIPPNLPSQMVNEFLAQEKERTRLRIQHLVEKEKLVLAVEQEILRVHGRAERAVANQALPFSVCTILRDKEVYNVLAPEQEEKRNAQRSRCNGRQINSWLQEVDDKWEKIKEGMLRRQHTEAETLHAVQIMGWEWKLKEHGLCDYKSTPKIDPTHVPQIHVSNFDLPA-