Monarch geneset OGS2.0

DPOGS214335
TranscriptDPOGS214335-TA1203 bp
ProteinDPOGS214335-PA400 aa
Genomic positionDPSCF300020 - 477495-481275
RNAseq coverage219x (Rank: top 45%)
Annotation
HeliconiusHMEL0047820.080.00% 
BombyxBGIBMGA004138-TA7e-16871.69% 
DrosophilaCG11710-PB1e-10248.22% 
EBI UniRef50UniRef50_G6D4V40.094.94%Putative uncharacterized protein n=4 Tax=Coelomata RepID=G6D4V4_DANPL
NCBI RefSeqXP_001648636.11e-13157.00%hypothetical protein AaeL_AAEL014389 [Aedes aegypti]
NCBI nr blastpgi|1571049282e-13057.00%hypothetical protein AaeL_AAEL014389 [Aedes aegypti]
NCBI nr blastxgi|1571302571e-12757.00%hypothetical protein AaeL_AAEL011724 [Aedes aegypti]
Group
Gene OntologyGO:00056342.8e-18nucleus
GO:00063552.8e-18regulation of transcription, DNA-dependent
GO:00082702.8e-18zinc ion binding
KEGG pathway 
InterPro domain[243-379] IPR0159471.2e-28PUA-like domain
[13-62] IPR0093492.8e-18Zinc finger, C2HC5-type
[250-338] IPR0073743.4e-16ASCH domain
Orthology groupMCL11807 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214335-TA
ATGTTTTTCTTATCCTATTTGCAAATCTACCCCGGTCGCCATCACTGTGACTGTCAAGCTTCTAAACATGAGTTGGTTAATAACTGTTTGCAGTGCGGCCGAATTGTTTGCAAACAAGAGGGCTCCGGCCCCTGTCTGTTCTGTGGGAGTCTTGTTTGTACTCCGGAAGAACAAAGAGAGATAAACGCGAAAACAAAGTCTAGCGCCAAACTAATGGAATCCTTAATGGAAAGGAACCGACCAAAAGGATGGGAAGCTGCAATCTCCCAACGGAATAGGCTACTTGAATATGATAGAACTAGCGAGCGTCGCACTCGTGTGACAGACGATGACAGCGATTACTTCAGCGCGAACAGCGTGTGGCTGAGCTCAACAGAGCGAGATAAGCTGCAGGCGTACCAGAAGACGCTGCACGAAAACAAACACGCTTCTAGACTTAACAAGAAGATGACCTTCGATTTCGCTGGTCGTCAAATAATCGAAGAGAATACAATAGACCACGAGGTAGATGAAGACAAGATCAGACAAATTACTAGTAATAATAAAAATAACAGTCAGTTCCTGGCACCGGATCTGTTGCTCGACTCGTCGGCCGACAGAGACGTCGCGCCCGGTGTCAACGCGCCATTATTAAAGTTCGATACATCGGTGGCCGTTCAAAGTAACGTTGGAGTGTCTCAGTCGTGGGGTGTGACAGGCCGCGTTCAGGACGGACAACTGCTTGAGATGAGTGACGCTGGTCGCTGTCTCTCAATGCACCAACCCTGGGCCTCGCTACTCGTGGAGGGAATCAAGATGCACGAAGGTCGTAGTTGGTACACGTCCCACCGCGGCCGGCTGTGGATAGCGTCTACAGTGCGCGCTCCCGAAGACAGCGTTGTACGAGCACTAGAGAACCAGTACAGTGTGCTGTATCCAGACAAGCAAATAAAGTTCCCGTCCTTTTATCCGACGGGATGTCTTCTGGGATGTGTGACGGTTGACGACTGTCTGTCACAGGAGGAGTACGCCAAGAAATACCCAGACGGTGAGAGTGACAGTCCGTATGTTTTCATCTGCTCGAATCCAATAAGTTTACGACTCAGATTTCCAATTAAAGGACAGCATAAGATTTATGCTTTAGATAAAACGATCCATCAGGCGGCAGTGAAATGTATACAGCGAATGGCTAAAATACAAGCGGAGGAAACTCGTTTAGCATGA

Protein sequence:

>DPOGS214335-PA
MFFLSYLQIYPGRHHCDCQASKHELVNNCLQCGRIVCKQEGSGPCLFCGSLVCTPEEQREINAKTKSSAKLMESLMERNRPKGWEAAISQRNRLLEYDRTSERRTRVTDDDSDYFSANSVWLSSTERDKLQAYQKTLHENKHASRLNKKMTFDFAGRQIIEENTIDHEVDEDKIRQITSNNKNNSQFLAPDLLLDSSADRDVAPGVNAPLLKFDTSVAVQSNVGVSQSWGVTGRVQDGQLLEMSDAGRCLSMHQPWASLLVEGIKMHEGRSWYTSHRGRLWIASTVRAPEDSVVRALENQYSVLYPDKQIKFPSFYPTGCLLGCVTVDDCLSQEEYAKKYPDGESDSPYVFICSNPISLRLRFPIKGQHKIYALDKTIHQAAVKCIQRMAKIQAEETRLA-