Monarch geneset OGS2.0

DPOGS201344
TranscriptDPOGS201344-TA1299 bp
ProteinDPOGS201344-PA432 aa
Genomic positionDPSCF300176 + 731320-734176
RNAseq coverage79x (Rank: top 64%)
Annotation
HeliconiusHMEL0123917e-16462.50% 
BombyxBGIBMGA003127-TA1e-8740.77% 
Drosophilal(2)37Cd-PA4e-4331.33% 
EBI UniRef50UniRef50_B0WIR24e-5333.80%Putative uncharacterized protein n=2 Tax=Culicinae RepID=B0WIR2_CULQU
NCBI RefSeqXP_001653791.13e-5433.56%hypothetical protein AaeL_AAEL009331 [Aedes aegypti]
NCBI nr blastpgi|1571229725e-5333.56%hypothetical protein AaeL_AAEL009331 [Aedes aegypti]
NCBI nr blastxgi|2420139041e-5233.17%conserved hypothetical protein [Pediculus humanus corporis]
Group
KEGG pathway 
InterPro domain[3-279] IPR0191363.4e-45Transcription factor IIIC, subunit 5
Orthology groupMCL14446 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201344-TA
ATGGGAGGTATAAAGGCTCTTTCACAGCATTACACCCAAGCTAATAAAAAGCGACTGGGATTTAGCTTTCAACCAGATAATCCTTTTATGAAGAAAATATATGCTGACGCAAAACCCACCGCCGGTGTCCTCTTTAAACTAAAGGTTAAGAAAACTAAATCGGGTAATGAGGTTAAGAAAGAAGTAATTTCAACATCTATTGTTGGAACTGTTAAGAAAATTAATAGGTTTGAATCAATGTGTGACTTCCAATACTTACCACTCAGTACACCACACATAGAAGGTGACAAACCACAATGTCTTATAGAACAAATCATACCCTCCGGCCTAGATGAGCTGAATTCCATATTGGAACCCACGCCTCTCTTTATAACACCATCAAATTTTACGAGGTCAGACAAACCGATAACATACTGCTACACAGAGAAACGCTATGTGACAAAGGATATGATGAAGGGCGAGTCCACAAATGACGAAGTACATAAGACAAGGATGGAGAGGTCTTTGCATTTACCGAGATTTATATTTTCACTGAATGAAGAGTTACCAACTGAACCCAATGAATATTATATTAAATTGAGAAATGCAAGACAAGCTCTGAATCCATCTTTAGAAGAGGAATACAATACAGTGGCAAAGCTCTTCGAAGAGAGACCGATATGGTCATTGAATCTAGTCAAGTTTCATACAAAGATAAAGCTGTCATCTCTTAAGGTGATAATGCCGTGTCTTGCATTGTACATGAGAGAAGGTCCGTGGAGGATGCTCTGGACAAGATTTGGATACGATCCAAGAAAAGAACCCGGCTCAAGGATTTACCAGACCCTGGACTTTAGGATGAGACATGCAGCCGGTGTTCACTCTATGGTGTCAACTCGTGATGAGTTCGTTCATTGCAAGAAGAAAGATAGAATTAAGAATTTAAGCAAAAGTGCTATAGATGACCTCTCAATAGAGGACACGGTTTACGAAGGCGCTGTTTACTTTAGACCCGGGATGGCGCCAACTCAGAGACAAATATACTACCAGTACTGTGACGTGTACCTGCCGGAGGTCCAGGAGCTCGTGTCTCTGTCCCCGCCAGCCGGGTACACGTGTCACGAGCGCCGTGGCTGGCTGCCTCCCGACACTGACCAGCTCTGTCGGGACCACATCTTCAGATACGTCATGCAGACTCTACTAGCGAATCGCGTTAAGTATGAGGACGGGGTTGGTACAGGCGGCGAGAGTAGCTCTGATGACGCGGACGAAGCAGCGAATGCTTCTGTCGCTGAAGTTGATGAATCCATTAATACATGA

Protein sequence:

>DPOGS201344-PA
MGGIKALSQHYTQANKKRLGFSFQPDNPFMKKIYADAKPTAGVLFKLKVKKTKSGNEVKKEVISTSIVGTVKKINRFESMCDFQYLPLSTPHIEGDKPQCLIEQIIPSGLDELNSILEPTPLFITPSNFTRSDKPITYCYTEKRYVTKDMMKGESTNDEVHKTRMERSLHLPRFIFSLNEELPTEPNEYYIKLRNARQALNPSLEEEYNTVAKLFEERPIWSLNLVKFHTKIKLSSLKVIMPCLALYMREGPWRMLWTRFGYDPRKEPGSRIYQTLDFRMRHAAGVHSMVSTRDEFVHCKKKDRIKNLSKSAIDDLSIEDTVYEGAVYFRPGMAPTQRQIYYQYCDVYLPEVQELVSLSPPAGYTCHERRGWLPPDTDQLCRDHIFRYVMQTLLANRVKYEDGVGTGGESSSDDADEAANASVAEVDESINT-