Monarch geneset OGS2.0

DPOGS209704
TranscriptDPOGS209704-TA1863 bp
ProteinDPOGS209704-PA620 aa
Genomic positionDPSCF300309 + 149530-154258
RNAseq coverage136x (Rank: top 55%)
Annotation
HeliconiusHMEL0225696e-6838.67% 
BombyxBGIBMGA014401-TA2e-3556.59% 
Drosophilarump-PA2e-0841.18% 
EBI UniRef50UniRef50_E9GZG14e-1440.38%Putative uncharacterized protein n=1 Tax=Daphnia pulex RepID=E9GZG1_DAPPU
NCBI RefSeqXP_974319.13e-1341.84%PREDICTED: similar to Hrp59 CG9373-PA [Tribolium castaneum]
NCBI nr blastpgi|3214641332e-1340.38%hypothetical protein DAPPUDRAFT_323682 [Daphnia pulex]
NCBI nr blastxgi|850678142e-1639.78%CRX like homeobox 2 [Homo sapiens]
Group
Gene OntologyGO:00001668e-08nucleotide binding
KEGG pathway 
InterPro domain[559-616] IPR0126778e-08Nucleotide-binding, alpha-beta plait
Orthology groupMCL23634 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209704-TA
ATGGATTCAAATGAAAATAACCAAAGTGACCAGGTGAAAGCTTTGGAATCATCGGTATTAAAATCAGAAGCTGTGAAAGAAGAGCCAGAAGACAATGAAAATTCTGAACTCTCAAATTATAACAACGTTGAAAAAATAGATGTTGATGCTAATCCGAGTAATTTACCATCAAATGGCCAGAAACGTAAGAGCCCCGAGAAATCATTGCCGGAGAACGATTGTAATGATAACCCAAATAGCATAAAAAGAAGGAGGTATACTGACACGCAAACTGAGGATCAGGTCGACCCTAGTATAGAAGAGGAGCCAAAGTTCTCTTTGATAGTGAGGAATGCAAGTTGTGACTGGACATCACAACAGGTTCTCAACTTCATCAGGGATACTTTTCGCTGTGATAAATCAGAGTTGAAGGAAGTATTGGAGTTGGCAGGTAGGGTAGTAATGTGCTCTGTTGTCACATCTCTGCACAGATATGCCAATGCTATGTACAGCCACCCTCTGGAAGCTGTTCAGGCAGTGTCAATGCTTAATGGGCAAATATTCTACGGGAAAAAACTTAAAGTCACTATAAACAGATCCCCTAACGTTAAAACCTTGTTGCCCAAAGGTCTTGTAAGTGTAGGGCCGGGCTTAGGCAAATTCGGAAAACCGGTTCGAGATTTACCTGATCAATACAAAAGATTCATCCAAGGACAAAACTCTGCGATTGATGCGAGTTTGTTCCAACCAGACCTTCTCAATAGAATCGGGGTGACCCTCGATCGGGACATTAATTATTCACGTTCCTCCGGGACTCCTTATAGTCAAAGTAGCCAAACCGATTTATGCACAAGAAACAGTGCTGATTCCAGTGAAAGTTTGTCCCAAATTTCGCACATGGATGAAAATCAATTTGGTCCTATCGGACAAAAACCGAGTTTCCATAATACGCCTTTGACGAATTTTACTGGGCAAAACTGTTCTGTCAATATTAATTCTAATGGCTCAAATATTTCAGCGAATCGAATGCCTGTCATGACTACCAGGGATCCTAGAAGCAACCAGAATTACATTTGTCCCCCAGTATCAAACGATTCTGATAGACTTGTTATTACCTCTGCTGTTATAAATCCCCAACATCCCGTTTCGGGGCAAATCAGAGGACCGGGTGCTATTCAGAATGCGGGTCGTGTTTTGGGGATAGCCAGTTCTGGGTCTATACCAGGTTCGGGTCTAATAAATTATCCTGGACCTATGACAAACTCTGGTCCGATTTCTGGACCTAACCCCATGCCAGGGCTCGGACCTCTGTCTGCTTCTGGTCCTATGCCTAGGCCTGGTGTTATTCGCGGTCCCATGTCTATGCCTGGCCCTATTTCAGGTCCTGGACCTTTATCAGGACCCGTTCGCGTCCCTGGTCCTGGAAATCTTCAATTAAGACATAGTCCTATTTCTGGATGTGCAAGACCTATGGCTCAAGGTCATCCTTCTGGACCATTACAATTAACAGGTCCAAGATGCCCACGACCCAATCAAATGGTTCATATTTCAATAGGCCCTAGAGTACCAGCACCATTAAATAGATTCCCCACTCCTATAAGTCCTATGGCATCAAGGCCTTTGGCACCTAGCGCACCTGTTCAGAACGCGATAAGACCAAATGGACCACGTCTGATGACCACAAACGTTCAAATGTTCAGAAGCGATCCGGTCACACTACAAATAAGCAACTTGCCACCGAATACGAATTTCCTGAATCTCGGTCACAAATTATCTGAACTAGGTCACGTAGTGTACTTGGAGTTCACAACACCTGGATGTGCTGTTGTGAGGTTCGCGAACCCAGCTGATGCTGATAGATGCTTTCGTATCCTTTGTGTATAA

Protein sequence:

>DPOGS209704-PA
MDSNENNQSDQVKALESSVLKSEAVKEEPEDNENSELSNYNNVEKIDVDANPSNLPSNGQKRKSPEKSLPENDCNDNPNSIKRRRYTDTQTEDQVDPSIEEEPKFSLIVRNASCDWTSQQVLNFIRDTFRCDKSELKEVLELAGRVVMCSVVTSLHRYANAMYSHPLEAVQAVSMLNGQIFYGKKLKVTINRSPNVKTLLPKGLVSVGPGLGKFGKPVRDLPDQYKRFIQGQNSAIDASLFQPDLLNRIGVTLDRDINYSRSSGTPYSQSSQTDLCTRNSADSSESLSQISHMDENQFGPIGQKPSFHNTPLTNFTGQNCSVNINSNGSNISANRMPVMTTRDPRSNQNYICPPVSNDSDRLVITSAVINPQHPVSGQIRGPGAIQNAGRVLGIASSGSIPGSGLINYPGPMTNSGPISGPNPMPGLGPLSASGPMPRPGVIRGPMSMPGPISGPGPLSGPVRVPGPGNLQLRHSPISGCARPMAQGHPSGPLQLTGPRCPRPNQMVHISIGPRVPAPLNRFPTPISPMASRPLAPSAPVQNAIRPNGPRLMTTNVQMFRSDPVTLQISNLPPNTNFLNLGHKLSELGHVVYLEFTTPGCAVVRFANPADADRCFRILCV-