Monarch geneset OGS2.0

DPOGS207144
TranscriptDPOGS207144-TA2226 bp
ProteinDPOGS207144-PA741 aa
Genomic positionDPSCF300001 + 3885428-3898590
RNAseq coverage391x (Rank: top 31%)
Annotation
HeliconiusHMEL0121910.071.18% 
BombyxBGIBMGA000579-TA2e-17568.63% 
DrosophilaCG18659-PA9e-11544.06% 
EBI UniRef50UniRef50_D7EJ887e-14748.28%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D7EJ88_TRICA
NCBI RefSeqXP_397549.22e-14855.82%PREDICTED: similar to CG18659-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3504066708e-14956.48%PREDICTED: DENN domain-containing protein 1A-like isoform 1 [Bombus impatiens]
NCBI nr blastxgi|665192752e-14341.82%PREDICTED: DENN domain-containing protein 1A-like isoform 2 [Apis mellifera]
Group
KEGG pathway 
InterPro domain[97-281] IPR0011943.4e-52DENN
[311-376] IPR0051128.1e-21dDENN
[21-94] IPR0051139.7e-15uDENN
Orthology groupMCL11475 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207144-TA
ATGAAACGTCACATTTTAGAAGGTCCAAGTCTAGATATGGGTTCCAGATTAAGAGATACTGTGCGCTTCCTTTTTGAATTATTCTGTGAAGTTTCTCCTGGTGACCAAGTGAGAGAGCCTTACATTATTAGAAAATATCCAGAGTCATACAAAAATGAGGAAGAGTTAAAAAATGTACCTAAGTTTACATTTCCTTGTAAACTGGAAAATACATTCATCCAACATTATTCATTTGTGCTGACTTCGGTGGATTCCAAATATACATTCTGTTTCTGCCGTTACGATCCAAAAACCAACACAGCCCTGGTTTTGCTGTCACATCTGCCATGGCACGATATATTTTACAAGTTGTTGAACTGCATCGCGACCTTGGAGAATGGACCGGAGCGTGGCGAGCTGACCGCGTTCCTAGCGGCGTGTCGCATCCGTCCGCCGGCTCCGGGCCACACGCTGCGGGTAACATACGACGCCGGACGTGGAGCCTTCACTTGCCGCTGTCCTGGAAGCGGATTGCCCAGCATCCCTGACAACTGCAATCTGACGGAGTACTTCAGTGCGCTAGAGGCCGGGCGTATGGCGGCGCTGTGGGCAGCGTTGTTGCACGAGCGGCGAGTGGCGGTGGTCGCATCAAAGCCGGCGAGGCTCGCGGCGTGCGTTCAAGCAGCTAACGACACGTTGTTCCCGATGTCGTGGCAGCATATTTTCATTCCGATCCTGCCGCCCCACTTGGTGGACTACTTGTTAGCACCGATGCCGTTCTTGATCGGTGTTCCTCGGAGCGTGATGGAGACGGTCCGGATGTCGGAGGTGGGGGACGTGGTGGTGCTAGACGCTGACTCCAACGAGCTGAAGAGTCCATTCCGTGATTTGGAAAGTCTTCCTGCGGAGGTTGTGGGTGCCCTTAGACGATCGCTCGCTGACCGCCAGGCGCTCGGTGACGCGGTATCTCGTGCCTTCCTGCGTGCGCTGGTCGCACTAATCGGGGGGTACCGGGACGCTATCAGAATCGAAAAAGGTCAACTTATCACATTTAATCCAGAGGCGTTTGTGAAAACTAGGAAGAATATGCAACCTTTCCTCCGCAAAATACTTCAGTCGCAAATATTTCAACAGTTCATAGATGAACGGCTGGAGTTGTTGAACTCCGGCCGTGGTTTCTCCGACGAATTCGAGCTAGAGTGTACGCGACAGGCGGAGCGGGCGGGTCTGGGGGGGGTGAACACACTCAAACAACAGTACCGCGAGTGGGCGCGCGGGGTGAAGAGGGAGGGGGGTGCGTTCTTCAAAACAGTCAAAGATAAGGCGAATCCCGCGGTACAGTCGGCCGTCAAGACGGTGCGACAGGGCGGTAAGAACATGAAGTCGGCTGTAAAAGGTCTGAAGAATAAAATGCCTAAGACTGGATCACGTCCTTCGTCCATAAACACCACAGATGACAGCAGGTATTCCTGCGGCTCTGGGACGCCGGTGTCATCTGACTCATCGTCTACGTCACGGTCGCCGTCACCTCCCCCCTCGCCGGCCCCCTCCCCCCTGGTACCTCTCACAGTGGTCCCACGTGCTCCGCCGCCTCCAGTACCCTTGCCTCTGGATCTGCTCACGGAGATGGAGCTGTTGTTCCCAGAGCGACGCTCTCCACCAAAAGACACCGATAAGGCGAAGACATTAAATCCACCGCGGTTACCGCAACGCCCCCACCCTCCCCTCAGGTACCCCATCGTATCCACCAGGAAGCTCATAGATCTGTCAGATGCGCCAGCACCCCCGCCGCGCGCTATTCCCGTATCAAACTTTACAACGAATATAACTAAGCACGAGGTCCCAACAGAGTTTCACACTAGCGCCATCCGATTCACGGAGGATAATGATAAGGCAAAACTCAAACTGACTCTATCAGCACATGCGCGGTCGCTCCCCGCACCCGCTCCGCCGCCCGTGCCCGCACCCGTGCCCGCGCCCTCTCGCACCGAGCACCGACCTCGACCCTCCAAGAGCCGGACCCAAGAGTCATGTGGCACAGATCTCATACAGCTAGACGATTCACCGACCGGCCTCGACCATTTCGATCCTCTCAAGTATCGGGATAAAGAGGACAAACAGACTATCAAAAGTTCCGCGGACAGTGCGCTCCTGCATGAGTACGGATTGGACTTCGGTCAGTTCGGTCTCTCTGAGCTGTCTGCTCCCCAAGCCAGCACACACGAGGGATGGACGACCTTCAACTGA

Protein sequence:

>DPOGS207144-PA
MKRHILEGPSLDMGSRLRDTVRFLFELFCEVSPGDQVREPYIIRKYPESYKNEEELKNVPKFTFPCKLENTFIQHYSFVLTSVDSKYTFCFCRYDPKTNTALVLLSHLPWHDIFYKLLNCIATLENGPERGELTAFLAACRIRPPAPGHTLRVTYDAGRGAFTCRCPGSGLPSIPDNCNLTEYFSALEAGRMAALWAALLHERRVAVVASKPARLAACVQAANDTLFPMSWQHIFIPILPPHLVDYLLAPMPFLIGVPRSVMETVRMSEVGDVVVLDADSNELKSPFRDLESLPAEVVGALRRSLADRQALGDAVSRAFLRALVALIGGYRDAIRIEKGQLITFNPEAFVKTRKNMQPFLRKILQSQIFQQFIDERLELLNSGRGFSDEFELECTRQAERAGLGGVNTLKQQYREWARGVKREGGAFFKTVKDKANPAVQSAVKTVRQGGKNMKSAVKGLKNKMPKTGSRPSSINTTDDSRYSCGSGTPVSSDSSSTSRSPSPPPSPAPSPLVPLTVVPRAPPPPVPLPLDLLTEMELLFPERRSPPKDTDKAKTLNPPRLPQRPHPPLRYPIVSTRKLIDLSDAPAPPPRAIPVSNFTTNITKHEVPTEFHTSAIRFTEDNDKAKLKLTLSAHARSLPAPAPPPVPAPVPAPSRTEHRPRPSKSRTQESCGTDLIQLDDSPTGLDHFDPLKYRDKEDKQTIKSSADSALLHEYGLDFGQFGLSELSAPQASTHEGWTTFN-