Monarch geneset OGS2.0

DPOGS206047
TranscriptDPOGS206047-TA1059 bp
ProteinDPOGS206047-PA352 aa
Genomic positionDPSCF300028 - 1109300-1112976
RNAseq coverage34x (Rank: top 74%)
Annotation
HeliconiusHMEL0028173e-11878.36% 
BombyxBGIBMGA000499-TA6e-7869.20% 
DrosophilaCG12029-PB5e-6089.09% 
EBI UniRef50UniRef50_Q9VZN48e-5889.09%CG12029 n=4 Tax=Diptera RepID=Q9VZN4_DROME
NCBI RefSeqXP_002046578.12e-6158.87%GJ12406 [Drosophila virilis]
NCBI nr blastpgi|2700031561e-7850.42%hypothetical protein TcasGA2_TC002119 [Tribolium castaneum]
NCBI nr blastxgi|2700031562e-9250.36%hypothetical protein TcasGA2_TC002119 [Tribolium castaneum]
Group
Gene OntologyGO:00036762.9e-18nucleic acid binding
GO:00082704.1e-05zinc ion binding
GO:00056224.1e-05intracellular
KEGG pathway 
InterPro domain[285-316] IPR0130872.9e-18Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL19004 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206047-TA
ATGTTGGCGCAATCCGCAAAGGTCAAAGAAGGAATTCGCGCGGTCCGGCGCTTGCGACACAACAAGGAGACATCTGAGCTGGAGTCACTGCTTTGTAAGCAGGAGGGAGCTTCGCTGGCGCCGTCCCACATGGAGCTATTTGATGGCGGCACGAGCTCGCGAGACGATGCCTTCCTTCGTCCCGCTCTGTGGGAAGACATCGCCTCCTCAATCAGGAATATCGATCCTGAAAACGCGAACATGCTTGCTCCCCTCGGAGCCACACACGTTAAACTCGAGGCTGAGGATGCACTTCTGGAGGAGTGCGTCACTCCTTTGCTCAGTCCCCTGGAGATTAAGACGGAAAGGCTCTGCGCCCCGCCGACGTACGCGCAACCTCAGCCGCCGCAGCATCAACCCCATCAGCAACCAGCTAGCTCATTCTCAAAGTACCCACCGTCGAGGCTGATTTACATGTCTCCGTTGACTCCACCGGGATCCGATCAAGGCAGCCCAGGGAATTCGATGCAGGGCGGGCCACGTAGGACGCCGCCACCGCCATACCACGCGCCGCCACATCCGCCTCAGCCACCACCGTTGCTGCGAGCACCCCATGCTCCGCATCCACCACAACATCCTTTGCCGCAAACTCATCACCATCCGCAACAGGCGATGCAGCAACCACAGCAGCAGTTACAGATGGCGCCACAACTGCCCCCGGTGCCTCATCCTCCACCGCACTCACGACTCACCTCACTATCCGTTAAATACAACAGACGGAACAATCCTGAACTAGAGAAAAGAAGAGTGCATCACTGCGACTTTATCGGTTGCACTAAAGTCTACACCAAAAGTTCTCATCTGAAAGCCCATCAGCGAATACATACAGGAGAGAAACCGTATACTTGTCAATGGCCTGAGTGCGAATGGCGGTTCGCGAGGTCCGACGAGTTGACGCGCCATTATCGCAAACACACCGGCGCTAAGCCGTTCAAGTGTGCGGTCTGTGAGCGCTCTTTCGCCAGGTCCGACCATCTAGCGTTGCACATGAAGCGGCACCTGCCCAAGACGTCCAAATGA

Protein sequence:

>DPOGS206047-PA
MLAQSAKVKEGIRAVRRLRHNKETSELESLLCKQEGASLAPSHMELFDGGTSSRDDAFLRPALWEDIASSIRNIDPENANMLAPLGATHVKLEAEDALLEECVTPLLSPLEIKTERLCAPPTYAQPQPPQHQPHQQPASSFSKYPPSRLIYMSPLTPPGSDQGSPGNSMQGGPRRTPPPPYHAPPHPPQPPPLLRAPHAPHPPQHPLPQTHHHPQQAMQQPQQQLQMAPQLPPVPHPPPHSRLTSLSVKYNRRNNPELEKRRVHHCDFIGCTKVYTKSSHLKAHQRIHTGEKPYTCQWPECEWRFARSDELTRHYRKHTGAKPFKCAVCERSFARSDHLALHMKRHLPKTSK-