Monarch geneset OGS2.0

DPOGS211205
TranscriptDPOGS211205-TA1071 bp
ProteinDPOGS211205-PA356 aa
Genomic positionDPSCF300007 + 876424-878901
RNAseq coverage401x (Rank: top 30%)
Annotation
HeliconiusHMEL0124561e-10465.47% 
BombyxBGIBMGA003187-TA1e-7648.23% 
DrosophilaCG12129-PA3e-4433.80% 
EBI UniRef50UniRef50_A7UR961e-7141.30%AGAP007571-PA n=6 Tax=Culicidae RepID=A7UR96_ANOGA
NCBI RefSeqXP_001848897.11e-7239.94%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|1700423612e-7139.94%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastxgi|1582854262e-6840.11%AGAP007571-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00063555.6e-25regulation of transcription, DNA-dependent
GO:00057375.6e-25cytoplasm
GO:00037234.2e-12RNA binding
KEGG pathway 
InterPro domain[3-354] IPR0092105.6e-25Predicted eukaryotic LigT
[65-124] IPR0181114.2e-12K Homology, type 1, subgroup
[61-129] IPR0040874.8e-11K Homology
[131-288] IPR0195101.5e-09Protein kinase A anchor protein, nuclear localisation signal domain
Orthology groupMCL12841 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211205-TA
ATGAATGATATCCTGAGACCGGAATTGCTTTGGATAGAGGGAAGGTGCTATAGAGTGCATGATCCAGCGACGGAAATAACAGCCTTTCAGGAATACGATTTATACGAAAACGATACTCCACTAAACGAAGTGGATGAAGACGATGATGACACAGCTTATGAGATAGTTATGATAGATAGCAACAGATATACAACTAATATCCACGTTCCAAGGCATTACATTGGATCAATAATTGGTAAAAAAGGTGCAACAATCAGTCGAATTGGTAGAGATACGAAAACTGTTATCAAAATACCTCGACACGGGGAAAATTCAGATATATCTATTTTCGGACCCAGTATTACTAATGTTAAAGCAGCTATACGGCGAATCAACATTATTGTGATGGCTGCAAGAATGAAGCAGAAACCCACACACTTCATATCCATACCAATGAATGCGGCTAATATAGTGGAAGGCTTCGAAAGATTCAAAGTACGAGTCCTTCAGGATTGTCCGAATGTAGATGAGTCTTTGTTCATTCGATCTACAAAACTACATATAACATTAGGAGTTATGTGTTTAATGGACAATGAAGAACGTCAGCTGGCATCAAAACTGTTATTGGAGGCAAAAGATAAATGTATTATGCCAATAGTGAAAGACTTTCTGCCATTGAAAATCAGATTAAAAGGTTTATCATATATGAATGATGATCCTAAGGCTGTTGACGTGTTATATGGCTGTGTCGAAGAAGTGGACGCGCCTTCAGGAATATTACAACAATTAGTTGACTCTATATTTAACCACTTTAAAAACGCAGGTTTAATGCACAGTTCTCAGAATCATGAAATCGATAATGTTAAGATGCATGTAACTTTGTTGAATTCCAAGTACAGGCAGAGACAACAAAATTCAGATTTAAACGATAACAAACATAAAAGGGAGACTTTCGATGGATCTGATGTTTTGTTGAAATTTTCCAATTATGACTTCGGTGTTACGGAATTGCGTGACGTGCATTTATCCCAACGTAATACTTCGGGACCGGACGGATATTACCTGTCAACTGTCATTATATCTGTCAACTGA

Protein sequence:

>DPOGS211205-PA
MNDILRPELLWIEGRCYRVHDPATEITAFQEYDLYENDTPLNEVDEDDDDTAYEIVMIDSNRYTTNIHVPRHYIGSIIGKKGATISRIGRDTKTVIKIPRHGENSDISIFGPSITNVKAAIRRINIIVMAARMKQKPTHFISIPMNAANIVEGFERFKVRVLQDCPNVDESLFIRSTKLHITLGVMCLMDNEERQLASKLLLEAKDKCIMPIVKDFLPLKIRLKGLSYMNDDPKAVDVLYGCVEEVDAPSGILQQLVDSIFNHFKNAGLMHSSQNHEIDNVKMHVTLLNSKYRQRQQNSDLNDNKHKRETFDGSDVLLKFSNYDFGVTELRDVHLSQRNTSGPDGYYLSTVIISVN-