Monarch geneset OGS2.0

DPOGS204102
TranscriptDPOGS204102-TA1521 bp
ProteinDPOGS204102-PA506 aa
Genomic positionDPSCF300184 - 274907-278795
RNAseq coverage124x (Rank: top 57%)
Annotation
HeliconiusHMEL0129270.063.93% 
BombyxBGIBMGA013591-TA4e-11850.10% 
DrosophilaCG42726-PA4e-3426.54% 
EBI UniRef50UniRef50_D6WCR32e-4934.86%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WCR3_TRICA
NCBI RefSeqXP_002734413.12e-3332.88%PREDICTED: zinc finger protein 111-like [Saccoglossus kowalevskii]
NCBI nr blastpgi|2700024107e-4934.86%hypothetical protein TcasGA2_TC004467 [Tribolium castaneum]
NCBI nr blastxgi|2700024101e-5534.05%hypothetical protein TcasGA2_TC004467 [Tribolium castaneum]
Group
Gene OntologyGO:00036761.5e-11nucleic acid binding
KEGG pathway 
InterPro domain[218-248] IPR0130871.5e-11Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL19619 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204102-TA
ATGAATACCCAAGTTTGTGTAAATTGTTACAATAAAAAGGCTGTTAATTCGACGGACAACCGTTTAGTAACGGAGAGTTGCGGTCATGTTAAGTGTATGGATTGTCTTCTGCAAGAAAAATCAGGTTGTGTCGCCTGTAAGCAGAACGTTAGTGAGCCTGAAACAATCGAGAGTTCAGAACAGAATCCTCCGCTGACGCCGGTTGAAAGCTACGAAGAACCAGCAGCCTGTGAAGATGCTTCTAACGAGGATAACGCAAACATTGTAGACCCCAATTCTAAGAAAAAATTAGAAATCTCTCACATTAGGATAGAAATGGATGGTGATAGAAAGTGTTATACATGTACAGTGTGCAAGAAAAAATTTTATGCTCGAAGCCAGGTGTCATATCATGCTTACTGCAACGGTCAGAAAAAACCATACAATTGTCAACTTTGTAAGCAGAGCTTTGCATCCCTATCACATTATAAATACCACACTCGAGTGCACAGCCAGGAACGTTCTTATGGCTGTGATGTATGCGGAGCAGGCTTCTATCAGATGTCAAAGCTGAAGAGGCACAAGTTGAAACATACCAAAGAGAAGAATTTCTCGTGCAGTGAGTGCAATAAAGCATTCAACAACATGTCCTCGCTGAGAAAACATGCTCGAACACACACCGAGGAACGACCGTACTCGTGTAACACATGCGGCAGGCGATTCAGAGACAGCTCCAACTATAAGAAACATGTCGATAAACACAAGAAACCATGCAAGTCGTGTGGAATGGAGCTTCAGGCGTCTGGTGTGCATCGCTGCGAAGGTCCTGGTAGTGGCAGTCCTGGTGGTAGGGGGAATGGTGGTGGGCCCACCACGGGGGGGCCTCGCGCTCACGCCTGCCCCCGCTGCAGGAAGGCCTTCCACTCCCGGAAGGACATGAGACGCCACGCAGCCATTCACTCAGATTCTAAACCGTTCCGCTGCAAAGCCTGCCCTGACGAACGACGTTTCAGACGCAAAGATAATTTAGAGCGGCACATCCGCAATGCACACCCCAACTGCGCACCCGCCACAGCTCTGGAATGCGATCTGACCGCCCTCCAGAGCGTAGCCCACCACGCCTATGAAATACACGAGAAGATACGCCTAGAAATTCTCAACCCATTACCGCCTCTACCTCAGGAAGTCATCCAGAAACACATAGACGTCGAGGTCGTTGACAAAAAGTCGATCATAGAGGCGAACAACGCCAGGGAGAGCGTCATAGTCGAGAAGAAACCCGAGGAGAAGAAGGTCGTTACTCCAGAGAACGAATATGTCCATAAAATAAGGAAAGCCATAATACCTCTACCGCCCATAGACCAGGAGAAGTTCAGGAGCGTTCAGAGAGGATTGTTACCGGACAGCGTGGCCGCAGCACCCATCAAGAATATGGAGATATACAAGAAGATACTGTACGAGAAGATAGAGAAGGACTCGGCCGAGGTCATACAGAACCCCAAGATGCATTGGAGGAGGAAGATGGAACAGGATAGCAATTAG

Protein sequence:

>DPOGS204102-PA
MNTQVCVNCYNKKAVNSTDNRLVTESCGHVKCMDCLLQEKSGCVACKQNVSEPETIESSEQNPPLTPVESYEEPAACEDASNEDNANIVDPNSKKKLEISHIRIEMDGDRKCYTCTVCKKKFYARSQVSYHAYCNGQKKPYNCQLCKQSFASLSHYKYHTRVHSQERSYGCDVCGAGFYQMSKLKRHKLKHTKEKNFSCSECNKAFNNMSSLRKHARTHTEERPYSCNTCGRRFRDSSNYKKHVDKHKKPCKSCGMELQASGVHRCEGPGSGSPGGRGNGGGPTTGGPRAHACPRCRKAFHSRKDMRRHAAIHSDSKPFRCKACPDERRFRRKDNLERHIRNAHPNCAPATALECDLTALQSVAHHAYEIHEKIRLEILNPLPPLPQEVIQKHIDVEVVDKKSIIEANNARESVIVEKKPEEKKVVTPENEYVHKIRKAIIPLPPIDQEKFRSVQRGLLPDSVAAAPIKNMEIYKKILYEKIEKDSAEVIQNPKMHWRRKMEQDSN-