Monarch geneset OGS2.0

DPOGS212719
TranscriptDPOGS212719-TA1194 bp
ProteinDPOGS212719-PA397 aa
Genomic positionDPSCF300012 - 421441-424926
RNAseq coverage195x (Rank: top 48%)
Annotation
HeliconiusHMEL0083126e-16180.18% 
BombyxBGIBMGA013125-TA1e-13872.62% 
DrosophilaCG10321-PA3e-3444.19% 
EBI UniRef50UniRef50_Q17Q021e-4655.28%Putative uncharacterized protein n=3 Tax=Culicinae RepID=Q17Q02_AEDAE
NCBI RefSeqXP_001658901.12e-4755.28%hypothetical protein AaeL_AAEL000170 [Aedes aegypti]
NCBI nr blastpgi|1571177144e-4655.28%hypothetical protein AaeL_AAEL000170 [Aedes aegypti]
NCBI nr blastxgi|1571177142e-4846.83%hypothetical protein AaeL_AAEL000170 [Aedes aegypti]
Group
Gene OntologyGO:00056349.7e-10nucleus
GO:00082709.7e-10zinc ion binding
GO:00036765.8e-07nucleic acid binding
KEGG pathway 
InterPro domain[15-83] IPR0129349.7e-10Zinc finger, AD-type
[297-323] IPR0130875.8e-07Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL19611 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212719-TA
ATGGCTGCCGCTGTTAAAACGCAAAAGTACAACTTTGAGAAGATATGTCGCGCTTGTTTGCAGATAAAGAAAGATATGAGACCATTGTTCGAACAGCTCACAGCTACCATGTTGATGGGCATATCAAAAGTGCAGGTTTCTGTGGGCGACGGTCTTCCGTCCCAGTTGTGTCTGCAATGCGTGCATCAGATCTCAAGGTGTCACGCCTTCAAGGATCTGGTGGAGAGGAACGACGTGGTCCTCCGCGAACAGGCGAAAGCAATGGCTGAAGAGACGCTCAAGAAGGATGAGGTGCTCGCGGAAACGAACCAGTACATCCAGTTTATAGAAGTGCCGAGCCTGAACAACACCGCCAGCCAATTACTGGACAGCTTCTTCCCCGAGACCTCCAACGACACCAACGTCACGCAGGAGATATTGAATAAGGACGACTTCGACGTTGACAAGCTGGAACCTTCCAGTGCAGTGGCCAAGGCGGAGGAGGCGTTGAATTCAGACGAGGAGAACTATCTCCAGATGGTCGTATTCCAAGCCACTTCATCAGTGGCCCCGTTCAGACACGTCTGCAATCTCTGTCAGAAAGAGTTCAAATACGCCAAGTGGCTGAAGATGCACATGCTGTCTCACTCTAACTGGATCAAAGCGAACTGCAAGAAGCCGCCGATGTGCCACATATGCGAGAGAACGTTCAAGGGTCCAGGGATGTTGAAGATGCACATGAGGACCCACGAACAGCGACCTCCTAAACAGCCGACGTGTTCCGTCTGCCAGAGGACCTTCCCCACCAAGACCTTACTGTATAGACACAGGCAGACACACTTCGAACAGAAGACACACCAGTGCACGGTTTGCGAGAAGCGGTTCTTCAGCGGGTACGCCCTGAGGTCACATATGGCAAGGCACAGAGGAGAACGGCCGTACATCTGCTCTATATGCCTCAAGAGCTTCTACAACCCCACCGATCTGAAGGTATCATGTGACGTGTCCATCCTATGTCGTGATGGACACTTAGCCTTTCAGCGTAAAATAAAAACTATAGTACTCATAATGTTTATTAAAACTCAGAACAGACCAAAGCTATTTGTAAATGGACGGCGTGGAGAGAGTTCCGCACTTGAGCTTGTCGAATTTCTTGTCTTCATCCTCCGTAGTCCTGAAGGCTTCTTCGAAAGCTTCTTTGCTGCGTTTGGATAG

Protein sequence:

>DPOGS212719-PA
MAAAVKTQKYNFEKICRACLQIKKDMRPLFEQLTATMLMGISKVQVSVGDGLPSQLCLQCVHQISRCHAFKDLVERNDVVLREQAKAMAEETLKKDEVLAETNQYIQFIEVPSLNNTASQLLDSFFPETSNDTNVTQEILNKDDFDVDKLEPSSAVAKAEEALNSDEENYLQMVVFQATSSVAPFRHVCNLCQKEFKYAKWLKMHMLSHSNWIKANCKKPPMCHICERTFKGPGMLKMHMRTHEQRPPKQPTCSVCQRTFPTKTLLYRHRQTHFEQKTHQCTVCEKRFFSGYALRSHMARHRGERPYICSICLKSFYNPTDLKVSCDVSILCRDGHLAFQRKIKTIVLIMFIKTQNRPKLFVNGRRGESSALELVEFLVFILRSPEGFFESFFAAFG-