Monarch geneset OGS2.0

DPOGS205442
TranscriptDPOGS205442-TA1230 bp
ProteinDPOGS205442-PA409 aa
Genomic positionDPSCF300332 - 108558-113438
RNAseq coverage190x (Rank: top 48%)
Annotation
HeliconiusHMEL0147180.094.38% 
BombyxBGIBMGA009293-TA0.090.49% 
Drosophilaking-tubby-PB2e-13554.51% 
EBI UniRef50UniRef50_Q86PC93e-13354.51%Protein king tubby n=17 Tax=Diptera RepID=TULP_DROME
NCBI RefSeqXP_974118.27e-14865.62%PREDICTED: similar to king tubby CG9398-PB [Tribolium castaneum]
NCBI nr blastpgi|2700057044e-14765.38%hypothetical protein TcasGA2_TC007804 [Tribolium castaneum]
NCBI nr blastxgi|2700057045e-14164.41%hypothetical protein TcasGA2_TC007804 [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[166-403] IPR0000075.5e-107Tubby, C-terminal
Orthology groupMCL11803 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205442-TA
ATGGCTTCGATAAATGTCCGGGATCAAAAAATCGAACAGCAACGACAATTCATGGAACAGAAAATGAAACAAAAGAGGCAGAATTCTGGTATGGTTCAAGCGAATGATCTTAGGGTAGGTTCTGCTAAGCGACCAATTTCTGGAAGCAGAACACGGGAACTGCATGGCTATGATGGTCCAATGCAATTTCTAATGTCGCCAGTGAATCCAGATCAAGTTATACCCCTCCAGACTAATAGAATGTCTACTTATGATGAATTAGGTAATCAGATTGAAGTTCTAACAGTGGGTGATGATATTGGGGAAGGTAGTGGAGAGGAAGAGGGTGAGAGTGTTCCGGTTTGTAGCATGGGACGGGACGTTAGTACAGATGATGTCTGTGCTGATGCCGCAGTGGCTCCCCTTCAAGGTCGGCCAAAACGGGATACATCTCCCAGTCAGACGGCTGAGATCGAGGGATCAGTTGAAGGTTCCGTGGAAACGTTTGTCATAACTCCAGCGAAACATGGCACGCTATACAAATGTAGGATAGCGAGGGACAGAAAGGGAATGGATAGAGGCTTGTATCCGACGTACTTCTTACACCTAGAGAAGGATTACGGGAAGAAAGTTTTTCTTTTAGCTGGTCGAAAGCGGAAGAAGTCGGCTACATCGAACTACCTCATATCAACCGATCCCACAGAATTAACGCGTCAAGCTGATAGCTTCGCGGGGAAATTGCGTTCCAATTTACTCGGTACGGCTTTCACTGTCTACGATAATGGGAAAGCATGGAGGAAAAACCATAGGGACCCACCGCGCCATGAACTAGCAGCTGTCATTTATGATACGAACGTGTTGGGTTTCAAAGGACCTCGCAAAATGACAGTCATCCTTCCTGGAATGACTCAAGATCGTCAAAGAGTTACAATAGCACCGCAGGACGATAGCGAGACGCTGTTAGAAAGATGGAAAAGTCAAAATTTCGATGACATCGTGGTCCTTCATAACAAAACGCCCGTTTGGAACGACGAAACACAATCGTACGTACTAAATTTCCACGGCAGGGTCACACAGGCGAGTGTTAAAAACTTTCAGATAGTACATGACTCGGAACCGGACTATGTTGTGATGCAATTCGGTAGGATATCTGAGGATGTCTTCACTATGGACTTCAGATATCCCCTCTGTGCATTGCAAGCCTTCGGTATAGCGCTCAGTTCGTTCGATAGCAAATTGGCTTGCGAGTAA

Protein sequence:

>DPOGS205442-PA
MASINVRDQKIEQQRQFMEQKMKQKRQNSGMVQANDLRVGSAKRPISGSRTRELHGYDGPMQFLMSPVNPDQVIPLQTNRMSTYDELGNQIEVLTVGDDIGEGSGEEEGESVPVCSMGRDVSTDDVCADAAVAPLQGRPKRDTSPSQTAEIEGSVEGSVETFVITPAKHGTLYKCRIARDRKGMDRGLYPTYFLHLEKDYGKKVFLLAGRKRKKSATSNYLISTDPTELTRQADSFAGKLRSNLLGTAFTVYDNGKAWRKNHRDPPRHELAAVIYDTNVLGFKGPRKMTVILPGMTQDRQRVTIAPQDDSETLLERWKSQNFDDIVVLHNKTPVWNDETQSYVLNFHGRVTQASVKNFQIVHDSEPDYVVMQFGRISEDVFTMDFRYPLCALQAFGIALSSFDSKLACE-