Monarch geneset OGS2.0

DPOGS204913
TranscriptDPOGS204913-TA1176 bp
ProteinDPOGS204913-PA391 aa
Genomic positionDPSCF300340 - 16642-22902
RNAseq coverage57x (Rank: top 69%)
Annotation
HeliconiusHMEL0071795e-3640.00% 
BombyxBGIBMGA001736-TA1e-5635.00% 
Drosophilamld-PD2e-1032.26% 
EBI UniRef50UniRef50_D1ZZH02e-1123.26%Putative uncharacterized protein GLEAN_07456 n=1 Tax=Tribolium castaneum RepID=D1ZZH0_TRICA
NCBI RefSeqXP_001954254.17e-1225.07%GF18186 [Drosophila ananassae]
NCBI nr blastpgi|2700054059e-1123.26%hypothetical protein TcasGA2_TC007456 [Tribolium castaneum]
NCBI nr blastxgi|3485414373e-2540.96%PREDICTED: zinc finger protein 436-like [Oreochromis niloticus]
Group
Gene OntologyGO:00056346.9e-07nucleus
GO:00082706.9e-07zinc ion binding
KEGG pathway 
InterPro domain[8-75] IPR0129346.9e-07Zinc finger, AD-type
Orthology groupMCL27813 Specific divergent
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204913-TA
ATGGAGGTCACGGATAAAAATTTATGTCGGATATGTTTATATTCTCAAAGTAAATTATACCTCATAACAGATACAGAGTTGCAGGAAATATACGAGAAACTAACTGACATCAAAATACTCACAACAGATGTCCATCAGTATGTCTGTTTTCTATGTTACACTGTATTGAGACAATGTCGAAAACTAAAACAACAGGCCATATGGTCGGAAAAATTGTTAAAGGAGAATGCCAATGCTAGTATTGAGATAAATCATACATTCAGGTTGAGCAAGTCAAATATAATGACCGTTGATATAAAACCGGAATTTGAATCGGGAGTCAGTGATGTAGTTATTAAGAGGGAAGTCGAAGACATTTATACAAGTGATGTGGAAACTCATACAAAAAATGATTCAGAAGCCAACGAGAAGACCGACTCCGAAGATGACATTCCACTGAAGAATATCAGTTCTAAGAACAAGGATGTAGTCGGCAAGAGGAAAGAAAGAATAATCAACGCCAAGGAAATAATTCTGAGCCACGAGGAACAGCTCCAGGAGATGATGGAGCGATCCAAGTCTGAGAACTACACTAACTCGCCCTACAAATGCGATCTGTGCTACAAGGGGTTCATTGACCCAACCGCTTTTGACAAACACAAGGAGAAACACGATAAGATAAGCGGTCCGTACAAGTGTGGTATATGCCACCTCCGCTACCCCAGCACCAAGTCCCTGAGGGCGCACACGGCATCCACACACTCCAGGAGTACATACCTGACGCACATGAGGAAGCTCCACGCGGGGGATCACGTGTGCAGGCAGTGTGGGGACAGCTTCTACAAGAGGCGAGGGCTCATGATGCACGTCAGCAAGTCTCATCGAAGGGAGCAGAATAACACAGAACCTCCCCCCAACTGCGTGTGCAAGGACTGCGATATACAGTTCACTAATATAGATGCTTACACCAGACATCTGTTGATAACGAACAAACACACACTCTTCAACGACGATCCCGAGCTACCCTCGGCTACCACCAGCGCCGTCACACCGGTGAGAAGCCGTTCCCCTGCGACCGTTGCCCCGCGACCTTCTCCAGCCGCGAGTACCTCCGGGTCCACAGTCGAAGCCACTCGGGCGAAAGACCTTACGTGTGCGCGCTCTGTGGACACGCCTTTAGCCAGAAACCTGCACTGA

Protein sequence:

>DPOGS204913-PA
MEVTDKNLCRICLYSQSKLYLITDTELQEIYEKLTDIKILTTDVHQYVCFLCYTVLRQCRKLKQQAIWSEKLLKENANASIEINHTFRLSKSNIMTVDIKPEFESGVSDVVIKREVEDIYTSDVETHTKNDSEANEKTDSEDDIPLKNISSKNKDVVGKRKERIINAKEIILSHEEQLQEMMERSKSENYTNSPYKCDLCYKGFIDPTAFDKHKEKHDKISGPYKCGICHLRYPSTKSLRAHTASTHSRSTYLTHMRKLHAGDHVCRQCGDSFYKRRGLMMHVSKSHRREQNNTEPPPNCVCKDCDIQFTNIDAYTRHLLITNKHTLFNDDPELPSATTSAVTPVRSRSPATVAPRPSPAASTSGSTVEATRAKDLTCARSVDTPLARNLH-