Monarch geneset OGS2.0

DPOGS209065
TranscriptDPOGS209065-TA1410 bp
ProteinDPOGS209065-PA469 aa
Genomic positionDPSCF300102 + 279229-283708
RNAseq coverage1874x (Rank: top 7%)
Annotation
HeliconiusHMEL0052752e-18068.96% 
BombyxBGIBMGA010057-TA5e-14158.11% 
DrosophilaCG1785-PA3e-5334.48% 
EBI UniRef50UniRef50_D6WA296e-6741.72%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WA29_TRICA
NCBI RefSeqXP_966971.13e-6239.09%PREDICTED: similar to Uncharacterized protein CG1785 [Tribolium castaneum]
NCBI nr blastpgi|2700016232e-6641.72%hypothetical protein TcasGA2_TC000477 [Tribolium castaneum]
NCBI nr blastxgi|2700016235e-8242.23%hypothetical protein TcasGA2_TC000477 [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[14-426] IPR0116871.2e-66P60-like
[1-466] IPR0112117.4e-37Tumour suppressor protein Gltscr2
Orthology groupMCL12926 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209065-TA
ATGACTGTAAATACTGCAATAAAGAAAAGAAAACGCGTGTCTAAAAAGAACAAAGCCTCGTGGCGTAAACATTGTGATATAAATGATGTCGAAGAATTCCTTGAAGACCAGAGATTAGAAGAAAGATTAGGGAAATTTGATCAGAAACCAGATGAAGAACTATTTATTGTAGACACAACTGGTGGTGATGTTGACGAGAAGCCGGAAGTCAAACCCGATCTTAAAGCTAAATCCTTTAAGGAAAGGAAAAGAGCCCAGTTGGCTGAAACACCAAAATGTTTTGAAGTGCTTTTACCTACATCTAAGGTTCAAGACCCTAACAAAAAGCGTAATACAGTCAAACAAGTTGGCTCCAAGCCATCAGATTTGTCATTATTAACAGAAAAGCGACAATTAGCTAAAGGTGTCCTCAGCAGAAAAGTAGAACAGTCTAAGGTTAATAGGAAACTAGCAATACAGAAGAAGAAAAAGTCCAAGACTGTCAGGCAGAGCTTTGATAAGAACATTTGGGACGCTCCAACTCTGGAATCCAAGGGTATACCGGAAACTCTTTGCAATGAATTTGTATCGACCGAGGCTCAGTTACATAATGTACCAACAACTAAACGCCTGCGAGCCAAGCCGCAGCTTCCAAAGACGCTGCTAACGAGAGCGGCCATCGATGTACCACATCCCGGTGTCAGCTACAATCCTTCATTCCAGGAACACCAAGCCCTGTTGTCGGAGGTGGTCCAACATGAACAGAAAATGATGAAGAGGCAGGCTCATCTCAATAGAGTCACTACAAACATGTTCAGTAAAGTGACGCAGGGTGAAAAGGATAAACAATGGCAGGAGGAAATGAGTGTAGGGTTGCCACAACCCCACAATCCTTCAAATGATCCCGATCCGGAGCCCTCGGATAATGAATATAAAGCAATCAACCCTCCAGTCAAGAACAAGAAAAAGGATCACAAAGCAAGAAGGAAACAGAGGGAACAATTGGAAGAAAAAGAAAGGTTGAAAAGAGCCAAGATTGAGAAGAAAAAAATCACCGATTTATATAGGCTCCGTAAGATCCAAGAGTCCCTCAAGAAGCAGGAGTCCCGTCAGTGTGAGAAGTCCACACGCCTATCTTCACGGCGTCAGGAAAAAGCTGCCACGGCACCACCAGCACTGAACAAACATCGACCACCTGAGAAAGAACCGGAGTTTGTAGACCCTGCCATACTGACCGGGGACCTCAGGAACCTGACCAATACCAGCAACCTCCTCCGAGATCGCTTTGAGTCTCTCCAGCGCCGCGGTGCTCTCGCCGCTTCAAAGGTCATGATGAAGAAGAAACGGAAAGTCAAGAGTTACTTTAAACCGGGTCACAAGGTCACCGAGCAAGACATCAAAAAATATATTAACAAACTAGGCAAGAAGTGA

Protein sequence:

>DPOGS209065-PA
MTVNTAIKKRKRVSKKNKASWRKHCDINDVEEFLEDQRLEERLGKFDQKPDEELFIVDTTGGDVDEKPEVKPDLKAKSFKERKRAQLAETPKCFEVLLPTSKVQDPNKKRNTVKQVGSKPSDLSLLTEKRQLAKGVLSRKVEQSKVNRKLAIQKKKKSKTVRQSFDKNIWDAPTLESKGIPETLCNEFVSTEAQLHNVPTTKRLRAKPQLPKTLLTRAAIDVPHPGVSYNPSFQEHQALLSEVVQHEQKMMKRQAHLNRVTTNMFSKVTQGEKDKQWQEEMSVGLPQPHNPSNDPDPEPSDNEYKAINPPVKNKKKDHKARRKQREQLEEKERLKRAKIEKKKITDLYRLRKIQESLKKQESRQCEKSTRLSSRRQEKAATAPPALNKHRPPEKEPEFVDPAILTGDLRNLTNTSNLLRDRFESLQRRGALAASKVMMKKKRKVKSYFKPGHKVTEQDIKKYINKLGKK-