Monarch geneset OGS2.0

DPOGS200290
TranscriptDPOGS200290-TA2244 bp
ProteinDPOGS200290-PA747 aa
Genomic positionDPSCF300026 - 528601-532280
RNAseq coverage205x (Rank: top 46%)
Annotation
HeliconiusHMEL0000410.072.56% 
BombyxBGIBMGA005637-TA5e-10570.99% 
Drosophilapen-PA5e-7034.52% 
EBI UniRef50UniRef50_D0AB890.072.56%Putative penguin n=13 Tax=Nymphalidae RepID=D0AB89_9NEOP
NCBI RefSeqXP_971048.14e-11342.28%PREDICTED: similar to GA14176-PA [Tribolium castaneum]
NCBI nr blastpgi|2613359510.072.56%putative penguin [Heliconius melpomene]
NCBI nr blastxgi|2613359510.072.56%putative penguin [Heliconius melpomene]
Group
Gene OntologyGO:00054881.3e-44binding
GO:00037233e-23RNA binding
KEGG pathway 
InterPro domain[272-587] IPR0160241.3e-44Armadillo-type fold
[282-588] IPR0119897.5e-38Armadillo-like helical
[543-678] IPR0129593e-23CPL
Orthology groupMCL14860 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200290-TA
ATGCAGAAATTAAAAAGAAAAAATGGCGATGACAATACTTCACCTTTGAAAAAGAAGAAAGTTCAGTTTAGCGAACCCACAAAAGATGGAAAAGGTAAAAATAACGAAAATAATAAGCCGTTTAAGAAAGGGGATTTAAAAAATAAGGAAATAAATAAACCGTTTAAGAAAGGGGATTTAAAAAATAACGAAAATAATAAACCGTTTAAGAAAGGGGATTTAAAAAATAACGAAAATAATAAACCGTTTAAGAAAGGGGATTTAAAAAATAACGAAAATAATAAACCGTTTAAGAAAGGGGATTTAAAAAATAACGAAAATAATAAACCGTTTAAGAAAGGGGATTTAAAAAATAACGAAAATAATAAACCGTTTAAGAAAGGGGATTTAAAAAATAACGAAAATAATAAACCGTTTAAGAAAGGGGATTTAAAAAATAACGAAAATAATAAACCGTTTAAGAAAGGGGATTTAAAAAATAACGAAAATAATAAACCGTTTAAGAAAGGGGATTTAAAAAATAAGGAAATAAATAAACCGTTTAAGAAAGGGGATTTAAAAAATAACGAAAATAATAAACCGTTTAAGAAAGGGGATTTAAAAAATGGAAAGAAGTTTTCTAAACCAGGATTTAAAGGTAAAGCTGGTTGTACAAAAAATACAAAGCTATCTTCAAAAGACGGCAAAGATGAAAAACCAAAATGGTCAGAAATGAAAAAAGAGAAAAAGAATTTACGATTAGAGCGACGTAAAGCTAAATCAACTGCAGAAATATTTGAAATTTCTAATAAAGCTAAATTATTAGCCGCACAAATTCAAAGAAAGGTGATAAAGCCTGATTTCAGAAGTAGTGCATGTAAAGAATTACACGCTCTCATTAAAGGCCAATATAAAGCTATTGCACTGACCCATGATCTTAGCAGAGTTATACAAGTTTTACTCAAGCATAGCTCAGATGAAATTAAACATGAGATAACAGAAGAACTGATGGACATCATGGCAACAATGATGCAATCAAAATATGCTCATCACTCGGTGAAGCGCATCCTCAAGTATGGCACCGATGCAATAAGACACCAAGTAATTAAGAAACTATTGGGCAATATTGTGTCTTTAGCTTCACATTCTATAAGTGCACCGGTACTAGATTATGCATACGGTGAATTTGCATCCAAAAAGGAAAAGATGCACATGCAACAAGAGTTCTATGGTGAAATGTATAAAAACACAAAAGATGATAGAGTGAAGACACTGAGTGACACTTACAAAGACAGTCCTGAAATGAAAGCTGCTATTTTGCAGTCATGTAAAGCGAACATACAACGCATTTTGGACAAAAATTTACATGACAGCGAATTACTGCATTCTGTGTTGTATGACTACATAAGAGAGTGTAGTAAGGAGGATCAAACGGAACTTATATCAAGTCTGAGTCCATTGATTGTACCTCTCAGTAACTCCCTGCCTGGAACTATATTGAAAGTAGTTAAGGAACATGTGGTGCCGCTCAGTAAACACAAAACAGGCTATAGACTCCTTATAGTGATATTCGATTCCGTCGATGACACAGTGTTAGTTAAAAAGGCAATCGTCTCAACTCTCGTCAGCAACCTGAAGGACATTGCAAGGGACCATTGGGGAAAGATGACATTACACTGGCTTGTAAAGCCTAAAGATTCGGCAGCATTCCACCCAACCTTTATAAAGTTCTTGGAGGAAGGACTTAAAACTGGAACTTCAAAGAAAGACACTGAAATTCGGGTTTCGGAGTTGAGAGAGCTGATTCTCCCCGCCATAAAGAGTGATATCGAAAATGATCCTGAATTCTGGCTGAAGGACAAAGCAACTTTACTGTTAACAATAGCTGTTTTATCTATTGATCACTCTAAAAAGGCATTGGAAGAACTCGCCAAAGTTATCTGTAAAGTAGATTGGACTATAACAAACAATGACAATAGCATACTAGCGATAGAAGACGCCGGGATGCACATGTGTTTGAAAAAACTAGCAGCTTTGGACAAAGATGCCGAAGAGTCACTCGGAAACGTTATTTGTGATAACATTGAGGATGAAACTTTGAAAATGTGGTTGGCTACAAATCGTGGATGTTTTTTCATTGTAAAACTCATAGAAAATAATGGAGAAAGTACATCAAAAAAATGGATAAAAAAGTTAAAACCTCATTCGAAATTACTGAAAGCGCAGTCCTCTGAAGGAGCCAAAATCCTTTTACAGAGTTTGTAA

Protein sequence:

>DPOGS200290-PA
MQKLKRKNGDDNTSPLKKKKVQFSEPTKDGKGKNNENNKPFKKGDLKNKEINKPFKKGDLKNNENNKPFKKGDLKNNENNKPFKKGDLKNNENNKPFKKGDLKNNENNKPFKKGDLKNNENNKPFKKGDLKNNENNKPFKKGDLKNNENNKPFKKGDLKNNENNKPFKKGDLKNKEINKPFKKGDLKNNENNKPFKKGDLKNGKKFSKPGFKGKAGCTKNTKLSSKDGKDEKPKWSEMKKEKKNLRLERRKAKSTAEIFEISNKAKLLAAQIQRKVIKPDFRSSACKELHALIKGQYKAIALTHDLSRVIQVLLKHSSDEIKHEITEELMDIMATMMQSKYAHHSVKRILKYGTDAIRHQVIKKLLGNIVSLASHSISAPVLDYAYGEFASKKEKMHMQQEFYGEMYKNTKDDRVKTLSDTYKDSPEMKAAILQSCKANIQRILDKNLHDSELLHSVLYDYIRECSKEDQTELISSLSPLIVPLSNSLPGTILKVVKEHVVPLSKHKTGYRLLIVIFDSVDDTVLVKKAIVSTLVSNLKDIARDHWGKMTLHWLVKPKDSAAFHPTFIKFLEEGLKTGTSKKDTEIRVSELRELILPAIKSDIENDPEFWLKDKATLLLTIAVLSIDHSKKALEELAKVICKVDWTITNNDNSILAIEDAGMHMCLKKLAALDKDAEESLGNVICDNIEDETLKMWLATNRGCFFIVKLIENNGESTSKKWIKKLKPHSKLLKAQSSEGAKILLQSL-