Monarch geneset OGS2.0

DPOGS210028
TranscriptDPOGS210028-TA1452 bp
ProteinDPOGS210028-PA483 aa
Genomic positionDPSCF300372 + 84818-91736
RNAseq coverage1019x (Rank: top 12%)
Annotation
HeliconiusHMEL0020900.077.61% 
BombyxBGIBMGA010720-TA2e-16759.51% 
DrosophilaCG9215-PA5e-4149.31% 
EBI UniRef50UniRef50_Q16JJ02e-5835.68%Putative uncharacterized protein (Fragment) n=1 Tax=Aedes aegypti RepID=Q16JJ0_AEDAE
NCBI RefSeqXP_001663463.15e-5935.68%hypothetical protein AaeL_AAEL013321 [Aedes aegypti]
NCBI nr blastpgi|1571354859e-5835.68%hypothetical protein AaeL_AAEL013321 [Aedes aegypti]
NCBI nr blastxgi|1571354856e-6134.98%hypothetical protein AaeL_AAEL013321 [Aedes aegypti]
Group
Gene OntologyGO:00056342.3e-15nucleus
GO:00082702.3e-15zinc ion binding
GO:00036761.4e-12nucleic acid binding
GO:00056221.6e-05intracellular
KEGG pathway 
InterPro domain[6-78] IPR0129342.3e-15Zinc finger, AD-type
[326-352] IPR0130871.4e-12Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL20528 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210028-TA
ATGGATTGGCAATTAACGTGCCGTGTATGCTTAGAAACTGGAGATATGGTTTCCCTTTTCGATTGGGATGAAAATAATGAACAACTCGGTGATAAATATACCTACTGTTGCGGCGTGGAGGTAACTAAAAATGATACATTGCCGACACTAATATGCCTGAATTGCGTGGATCGTTTAACCTACGCCTGCCAATTCAAACAACAATGTCTCTCATCGAATGAAACATTAAAAGAGTGTCTCGAAGAATTTTTAAAAACATCGGCTGCAGTTTCAGCAAACAAACCAGACGAAGGTACTGAAACAGAAACTATAACAATCAAGCAAGAAAATGGTATGCTTTTACAATATGAGTTGCCAGTGGAGCATGACCCACAGGTTCAATGGACGAGGATCATTAAGACTCCTAATAGTGCCGTTGTTACTAAAGCACCGGGTCGGAGGAAGAGAGGTCGGCCACGGAAATATCCTAATGAACAGGGGCAGATTGAACTGGATGGTGGTGATGATCCGGACTTTGTGCCAGTCGGTGAGGACCCTGATTCTGATCATGATATCAAAGAGGAGATACCTGTACCAAAGAAACGTGGCCGTCCGCGGAAGTCTATCCAGGAACAGCCTAAGCCGAAGGACGATGACGAGGACGTGCTCCTCAAGGAAACCATCATGGCCTTCTCGGAGCCCATACCTGAACATATACTGAACCCCAAGCCAAAGAAGAAACGACAACAGCCAAAAAGAAATATACATGTTTGTGAAACTTGCGGGGCCTCGTTCACTTCGAATGCATCCTTGCAAGCTCATATACGTCGTCATTTGGGCATAAAACCGTTCGTGTGCAGTGTGTGCGGCTACGCGTGTGTCCTGAACATGGAACTGCGTCGGCACATGATGCGGCACACCGGCGTGAGGCCGTACAAGTGCAGGGTGTGCGACAGAAGATTCGGCGACTTCGGCAGCCGGCAGAAACACGAACGATTACACATGGGTCTTCGTCCATATCAGTGTTCGTTGTGCGGCAAAGCGTTTACATATTCATATGTGCTAGCCAACCACATGCTGACACACACGGGCGAGAAGAAATATTCTTGTACTCCGTGCAACAAGAAGTTCACAAAGGCGCATCACCTGAAGTACCACAATAAGGTTCATCACAAGGAGCTGTACATCCAACAGCAGTTGGAACAGGAGGCGAGGAAGATCAGGCAGCAGTTGAATGTTACCAACCTGTCCGGAGTACTCACAGGACAGTTTGTGGACGGGACACTCCAACTCATACAGACTAACGAAGACGGCGGAGAAGAAACTCAATTACAAGTTGTCGAGGAACACGAGGATAGCGAGGAGAGGGAGACGCACCAGGCCGTGGCCAACATGCAGGGGGTGGTGTTAGAAACTGACTTCGCTATAGAGGAAGACGACGATGAAGACAGTAAAGATAAATACGACAATTGA

Protein sequence:

>DPOGS210028-PA
MDWQLTCRVCLETGDMVSLFDWDENNEQLGDKYTYCCGVEVTKNDTLPTLICLNCVDRLTYACQFKQQCLSSNETLKECLEEFLKTSAAVSANKPDEGTETETITIKQENGMLLQYELPVEHDPQVQWTRIIKTPNSAVVTKAPGRRKRGRPRKYPNEQGQIELDGGDDPDFVPVGEDPDSDHDIKEEIPVPKKRGRPRKSIQEQPKPKDDDEDVLLKETIMAFSEPIPEHILNPKPKKKRQQPKRNIHVCETCGASFTSNASLQAHIRRHLGIKPFVCSVCGYACVLNMELRRHMMRHTGVRPYKCRVCDRRFGDFGSRQKHERLHMGLRPYQCSLCGKAFTYSYVLANHMLTHTGEKKYSCTPCNKKFTKAHHLKYHNKVHHKELYIQQQLEQEARKIRQQLNVTNLSGVLTGQFVDGTLQLIQTNEDGGEETQLQVVEEHEDSEERETHQAVANMQGVVLETDFAIEEDDDEDSKDKYDN-