Monarch geneset OGS2.0

DPOGS205371
TranscriptDPOGS205371-TA1149 bp
ProteinDPOGS205371-PA382 aa
Genomic positionDPSCF300373 - 95684-98152
RNAseq coverage39x (Rank: top 73%)
Annotation
HeliconiusHMEL0134534e-4130.45% 
BombyxBGIBMGA008769-TA6e-11760.61% 
DrosophilaCG15269-PA7e-1732.31% 
EBI UniRef50UniRef50_D6WZ242e-2428.76%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WZ24_TRICA
NCBI RefSeqXP_312793.45e-2331.08%AGAP003111-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|2700125968e-2428.76%hypothetical protein TcasGA2_TC006757 [Tribolium castaneum]
NCBI nr blastxgi|2700125963e-3229.29%hypothetical protein TcasGA2_TC006757 [Tribolium castaneum]
Group
Gene OntologyGO:00036762.6e-08nucleic acid binding
KEGG pathway 
InterPro domain[348-379] IPR0130872.6e-08Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL30612 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205371-TA
ATGGGAGCGAGCTTTAAATGCTTCTTCTGTGCGAAATACTTCTCGGATCCCGGAGAACTGATGCTCCATACACAATCCCACAAGAAAGATATAAGAGACAGAAAGATTATTTTAGATAAATATATACCGAAAGATTCCGACACTGCCGGTGATAAGGATTGGGCGAAAATAGCAGAAAGACGCGGTCGAGGACCCCTATTGAGAGACAATTCCATTAAAATTCTCCAAAACTCAACAATGTTTATCTTCCAATGGCACAAAAGTAGATTTACCTGTTTCCTTTGCAAGGCACCGTTCATAGATATACAATCATTGAGAGATCACACTAAGGCAGAACACGATGGCGTTGAGAAATGTAAAATAGAAAAGAAGATAATAGCCCAACAAAATAAACTGCTCAAAGTGGAAATATCTATTCTGAAGTGTAAACAGTGCGATGGGGACTTCAAGACGCTTTCCGATTTCCGTCTTCATCTGGAGGAACAGCACGAGGTGGAATTCAAAGAGGGCGGCGATTTGCTGGTGCCGTTCAAGTTGGAGAGCGAAGGTCTCAAATGTCAAACGTGCAGAGAAAGTTTTACATTGTTCAGACTCCTCAGCATCCATGTGAACAAGCATTACCAGAACCACGTGTGTCACGTCTGTGGAGCTGGTTTCACAAGTTTGGTGCTGCTCAACCTCCACAGAACGAGATCGCACAGGCTGATAAAATGCAACGAGTGCAATTTGATATTTCATAATCGTAAGGGCAAAAAAATACACGACATAGACCGTCACAACGTGAGGTTCGAACGCAAGCTTCGTTTCCTGTGCCCGTACTGCGATGAGAGGTTCTTTCAAGAGAATCTCAAAATACAACACCTGGTCGACAAGCATGGCGTTGAAAAACCGCAGCACAGATGCAATTTATGCGACAAAGTCTTTATAACCAGAAGCCTCTGTAACAATCACATCAAAAATGTCCACAAAAAAGAAAAAAAACACGTATGCGATGTATGCAGTAAGCTATTTTACACGAAATCCGACGTGCAAAGACACAGAGTGACGCACACGGGCGAGAAAAAGTTCACCTGCCTGGTCTGCAATAACTTGTTCGCAACGAGAGACTCACTGAAGAGACACACGAAAAGGACCCATGTTGAGAATTAA

Protein sequence:

>DPOGS205371-PA
MGASFKCFFCAKYFSDPGELMLHTQSHKKDIRDRKIILDKYIPKDSDTAGDKDWAKIAERRGRGPLLRDNSIKILQNSTMFIFQWHKSRFTCFLCKAPFIDIQSLRDHTKAEHDGVEKCKIEKKIIAQQNKLLKVEISILKCKQCDGDFKTLSDFRLHLEEQHEVEFKEGGDLLVPFKLESEGLKCQTCRESFTLFRLLSIHVNKHYQNHVCHVCGAGFTSLVLLNLHRTRSHRLIKCNECNLIFHNRKGKKIHDIDRHNVRFERKLRFLCPYCDERFFQENLKIQHLVDKHGVEKPQHRCNLCDKVFITRSLCNNHIKNVHKKEKKHVCDVCSKLFYTKSDVQRHRVTHTGEKKFTCLVCNNLFATRDSLKRHTKRTHVEN-