Monarch geneset OGS2.0

DPOGS215340
TranscriptDPOGS215340-TA1134 bp
ProteinDPOGS215340-PA377 aa
Genomic positionDPSCF300120 + 413363-426304
RNAseq coverage500x (Rank: top 25%)
Annotation
Heliconius% 
BombyxBGIBMGA007975-TA2e-7877.30% 
DrosophilaCG8671-PA3e-5642.32% 
EBI UniRef50UniRef50_D6WB472e-6448.09%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WB47_TRICA
NCBI RefSeqXP_968107.11e-6549.69%PREDICTED: similar to Protein FAM102A (Early estrogen-induced gene 1 protein) [Tribolium castaneum]
NCBI nr blastpgi|910762783e-6449.69%PREDICTED: similar to Protein FAM102A (Early estrogen-induced gene 1 protein) [Tribolium castaneum]
NCBI nr blastxgi|3454799059e-6748.97%PREDICTED: hypothetical protein LOC100121584 [Nasonia vitripennis]
Group
KEGG pathway 
InterPro domain[13-150] IPR0194489.6e-22Oestrogen-responsive protein Fam102A/B, N-terminal
Orthology groupMCL15630 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215340-TA
ATGGCGTTCATGACCATGTCCAAGAAGAAGCGATACAAGTTCGGGGTGCAGTGCTGCCTCGAGGAACTGACCGAGGTGCCGTTCGTGTCGGCGGTGCTGTTCGCTAAGGTGCGCCTTCAGGACGGAGGGACCTTCCAGGATCACTCCAGCAGGGAGGAGGTGAGGAACCACGCCGTCCGGTGGAACGCTCAGTTCTCGTTCGTGTGTAAGATGTGCGCCAACGCCAACACCGGGGTGTTGGAGCCAGCGAGGCTCAGGGTCTCCGTCAGGAAGGAGTGCAAGGGTGGACGATCCTACCAGAAGCTGGGCTTCTGCGATGTGAACCTGGCGGAGCTGGCGGGCGCCGGGGAGACGGTCCGCCGCTGTCTGCTGGAGGGATACGACCCTAACAGGAGGCAGGACAACTCCGTGCTCAGGATACGAATCAAGATGAACATGATCTCAGGGGACCCGCTCTTTAAAGTACCTGAGAGGAAACAGGAAGCGGCAGAGAAGACGGGCGCGGACAGCGGGTCGGAGAGCACCGCGCCCGCCGACGACGACGCCGCCTCTTCGGGCACCAGCTCGGGCTTCGGGTCCCTCACCAAGAAGAAGAACTACGAAGGTGTGACGCCAGCACTGTCGTCTCTACCGTCCTGCGAGCTGCCCTCGCCGGAGGGAGAGGAGCTGCCACCTGTACCAGTGCTGCCCCCGCCCGCCCCCGCTTCCGCCGCCGCCCCCGTCCCCGTCCCGACGACCATCAGCGACTACCCCGGAGTGGTGGTAGTGACGACTCAGCCGGGCGCGGGGGCTGCGGGGGTCGCGGGCGCCGCCGGGGGATGTGTCGCCTGTCTGCAGCACACTCACTCGAGGAACTCCTCCAACACCTCGGGGGACATGAGCAGCAAAGCCTCGGGCTACGGCAGCTCGGTGTCGGCGGCGTCCGCACACTCGAGACAGAGCTCCGAGGGGGAGTCGACGGGGGACTGCAGACCCCACCACAACAGGTACGTCGATAACACACACTGCGGGCCGCCGGGGTCACACCCCGCGACTCGCAGACGGTTACTGTTAAATGAAATTAAAAATATAAACCTTCGGCTGGAGGGCGGCACGAGAGGGACGAACCTCGTGACCGAGGGCGACACGGCGTGA

Protein sequence:

>DPOGS215340-PA
MAFMTMSKKKRYKFGVQCCLEELTEVPFVSAVLFAKVRLQDGGTFQDHSSREEVRNHAVRWNAQFSFVCKMCANANTGVLEPARLRVSVRKECKGGRSYQKLGFCDVNLAELAGAGETVRRCLLEGYDPNRRQDNSVLRIRIKMNMISGDPLFKVPERKQEAAEKTGADSGSESTAPADDDAASSGTSSGFGSLTKKKNYEGVTPALSSLPSCELPSPEGEELPPVPVLPPPAPASAAAPVPVPTTISDYPGVVVVTTQPGAGAAGVAGAAGGCVACLQHTHSRNSSNTSGDMSSKASGYGSSVSAASAHSRQSSEGESTGDCRPHHNRYVDNTHCGPPGSHPATRRRLLLNEIKNINLRLEGGTRGTNLVTEGDTA-