Monarch geneset OGS2.0

DPOGS202816
TranscriptDPOGS202816-TA1122 bp
ProteinDPOGS202816-PA373 aa
Genomic positionDPSCF300018 + 421052-422248
RNAseq coverage63x (Rank: top 68%)
Annotation
HeliconiusHMEL0059933e-15972.16% 
BombyxBGIBMGA001684-TA4e-2531.87% 
DrosophilaMeics-PA2e-2531.06% 
EBI UniRef50UniRef50_D6WJ671e-2829.67%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WJ67_TRICA
NCBI RefSeqXP_001815603.12e-2929.67%PREDICTED: similar to Zinc finger protein 26 (Zfp-26) (Protein mKR3) [Tribolium castaneum]
NCBI nr blastpgi|1892378734e-2829.67%PREDICTED: similar to Zinc finger protein 26 (Zfp-26) (Protein mKR3) [Tribolium castaneum]
NCBI nr blastxgi|2700042675e-3830.75%hypothetical protein TcasGA2_TC003594 [Tribolium castaneum]
Group
Gene OntologyGO:00056343e-12nucleus
GO:00082703e-12zinc ion binding
GO:00036764.4e-08nucleic acid binding
KEGG pathway 
InterPro domain[7-80] IPR0129343e-12Zinc finger, AD-type
[308-331] IPR0130874.4e-08Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL23619 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202816-TA
ATGAATTATAAGGATCGATGTAGAACATGCTTAGGCAGTGGCAAAGAAATGCGACATGTTCATAATATTGTGTCGATAACCGGAGAAAATGTTCGCTTATCCGATATTTTAGAGAATTTTTATAACTATAAGATCCCATCGAACGATTTGCTACCCGACCAAATATGCATTAATTGTGTTCATCAACTAACTATAACCTACTCTTTTAAAATACTTATAAAATCTAGTGAAAAAACATTGTCTGATAGTTTAAATCTCACCGATTCAACAAGTGACAATTATGACGCAAGTTTTGATTCAGACGACCTCAAACCCATCAAAGAATCTCATAAATCTCACAGAAACAGAATTATTAATATGGAGAGAAGAGTATCAAAATTAGAGGACTCTCTAATATTCTGTGTAGAATGTAAGGCTGAATTCCATTCTGTTAAATATTTAAATGAACACTGCCGTAATAGTCATCCTATAAAATGCACAATTGGTCGGGATTGTGATTTTTGTCACGAAAAGTTTGAAGATTTTCGGTCATTAGTATTACATAGAAAATTGCATTTAAGGCCATTTGTTTGTGAAAACTGCTGGGAGGGATTTTACAGTGCAATGGAGCTGAATAATCATTCCTGTCAACCAAATTTAGACAAAAAGGACAAAAATACGTCTGAAAAAGTGCTACGGCAGTGTGATCAGTGTGGTAAATCATATCCTCCTGGCTACATCAGGATTCATATGCTAACACACAGTAGCGATCGGCCGTACAGTTGTAAATATTGTCCAAAAAAATTTAAAGTTCCCGGAAGTTTACATTCACATATTCTATGGAATCACAAAAGGACACGAAACCACAAATGCGAGGTCTGCAATGCTACATTCATATCCTCCAGCTCTAGAAGTTCACATATACGTAAAAATCACTTAAAAGAGAAAAAATACGGTTGTGAAAGCTGTGGCAAACGTTTCTTCTCAAAGTCCGAGTTACAGAGACATTCACTAACTCACACCGGCGTCAAGAACTTCCACTGCCACATGTGTGATAAATCATATCAAACGAGGTACGGACTGAACGTCCACTTGAAGTCGCACACACAAATGTCTATGAATGTGTTAAGTTGTAACATGTAG

Protein sequence:

>DPOGS202816-PA
MNYKDRCRTCLGSGKEMRHVHNIVSITGENVRLSDILENFYNYKIPSNDLLPDQICINCVHQLTITYSFKILIKSSEKTLSDSLNLTDSTSDNYDASFDSDDLKPIKESHKSHRNRIINMERRVSKLEDSLIFCVECKAEFHSVKYLNEHCRNSHPIKCTIGRDCDFCHEKFEDFRSLVLHRKLHLRPFVCENCWEGFYSAMELNNHSCQPNLDKKDKNTSEKVLRQCDQCGKSYPPGYIRIHMLTHSSDRPYSCKYCPKKFKVPGSLHSHILWNHKRTRNHKCEVCNATFISSSSRSSHIRKNHLKEKKYGCESCGKRFFSKSELQRHSLTHTGVKNFHCHMCDKSYQTRYGLNVHLKSHTQMSMNVLSCNM-