Monarch geneset OGS2.0

DPOGS208522
TranscriptDPOGS208522-TA2601 bp
ProteinDPOGS208522-PA866 aa
Genomic positionDPSCF300064 + 148174-159560
RNAseq coverage51x (Rank: top 70%)
Annotation
HeliconiusHMEL0036540.092.85% 
BombyxBGIBMGA008374-TA1e-16093.23% 
DrosophilaCG9650-PH7e-11751.89% 
EBI UniRef50UniRef50_D2A1A87e-16557.20%Putative uncharacterized protein GLEAN_07120 n=2 Tax=Tribolium castaneum RepID=D2A1A8_TRICA
NCBI RefSeqXP_975280.10.055.67%PREDICTED: similar to B-cell CLL/lymphoma 11A [Tribolium castaneum]
NCBI nr blastpgi|910815830.055.67%PREDICTED: similar to B-cell CLL/lymphoma 11A [Tribolium castaneum]
NCBI nr blastxgi|910815830.057.83%PREDICTED: similar to B-cell CLL/lymphoma 11A [Tribolium castaneum]
Group
Gene OntologyGO:00036763e-12nucleic acid binding
KEGG pathway 
InterPro domain[516-546] IPR0130873e-12Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL15740 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208522-TA
ATGACCAGTGTCCGGTTAGAGGAGGCGTGGCTACCTGCGGCTGCGGATGCGGACAGCGCGGCAGAAGGCGGAGGTTCGTCGTGTTTGCCAGCGGATACGCTGACATGCGGGGCCTGCAGGAAGGCTTTCGCGCTAGCGGACATAGTGCGATTCATTCAGCACAAAGTCTCCTCTTGCGATAAAGACCTCACTTCATACCACTGCTATAGTGGAGGTCCAAACTCAGACCAAGAAGATGGAACCCGACCAGGACAAATTGCTTCTGGCAACACGAGTGGGCGAAGACCCTCACTCCTTACAGCAAGGCGACCTCCCAACAGTAGAGTGCATACTCCTCCTCTCGCAAGCCCTTCGATAGCTCCACCAGATTTACTTGAGGATGGCGGGGCATCAAGCACTCCTAAGCGGCTGTTGGATGAAGCTGACGCAGGAGTATCGACTCCGAAACGAAGAGCATCTACTTCACCGATGCCTTCGAGTTCACCAGATGAAGATATCAAACCTAAAATTAAACAAGAGCACATGGACAGAACCAGCTCACCTGAAGATCATAAGAAGTCGAGAACCGAAATGGCTGACGCTGAATCAAATACCATGCATAGTGAACCGAGCAACTACGTCTGTTCTACCTGCAAAGCGCGAGTACACTCTGCTTGGAGACTGGTTCAGCACGTTCAGCACGTGCATGGCGTAAAAATTTACGTTGAGAGCATGCCTCAGCAATTACCGAATAAACAGAACCATTCATCATCGAGCACGTCATCATCAAGCTCTGGTTGTTCTTCTACTGGTGCTCCATTACCGCCACCATCGCTTCGACATCATCCTTTATTGCCACCACCAGACATGCACTCTCCTTTTGGAGTGGGAGGCTTATTGAGGATGCCTCTGCCGGGTAGCTTACCCCCACTAGCTCATCCTTCAGTACCACCGACACCTTTGTTTGCAAGACCAAATCATCACGATCATAGATTTAGAATGGAACAACTGGTGTCTGAGCAATTTCGCCATCACGGACTCAACTTGGCAGCAGCGGCCGCCGCTGTGGCAGCAAATTCTCTGCCACCACACCAGACATTTCCGTCACCAGCCGATAGACCACCTATCGTTCCAACATCTTTGACAGGACGTGATAGACCACCAGTGTCACAACCACTTTCTCTAGAACCCCAGCTGGATTTTTACTCGCAGCGCTTGCGGCAACTGGCCGGTACGACCAGTCCAGGGGCGGCCACAGGCAATTCGAGCTCTCCTAGCCCCAGGAAGCATTCACCGCCGTTCGCTTCCCCGTCGCCCTCCCGAGTCGGCCAGACTCCTCCGGTGGGCGCCGGACCCGGAACGGTGGACACTCCTCGAGAAGCCATAAGAGCTCAGAGTTCCATATCCCCAGAAAGACGAAATGAACCTCAAACAGCTGACGCACCACCTTCGGATAGGCCGTCTTCAACACCACCTACAAAGAGAAACAATGATGAAGCCATTCATACATGTGAATTCTGTGGGAAGAAGTTCCGTTTTGAAAACAGTTTAATAGCTCACCGACGTATTCATACTGGAGAAAAACCCTTTAAATGTAATCAGTGCGATGAAACATTTGAAAAAAATTCAAAATTAAAAAAGCACATGAAAGCTCATCGAGCAGCTGAAGGTAATACTGAAGACTTAGAATCCGGGGGAGACACTGGTGAAGATGATTCTGAAGATGACTTAGATGATGAGGAATTAGAGGGTGAAGAGGAGGAAGAAAATGAAGATGGAGAAGAGGTTGAAGAAGCTGAAGATCTTACCGTATCTAATCATAGTGCTCCTTCAGCGCCCCCACGTAAGCAAACTCCAGCGATACCTGCACACCCACCTACCGCATCCGTGGTCGGTGAACTCATGGACAAATTTGGATTATCTAATATTGCACAATATAGTGAAGCTTTTAAACAAGCTTTGCAAGAATCTGGCAACTCTTTAAAATGGCAATTAGCTAAAGACCGTGACAACAACAATGGTCCTCCTTCTGAAAAACCTAACGGTATGCCACCAACAGCGGCTCTTCGTTTAAAAGAGGAGTTTGCCAAAATGCCACCTCAGCCACATCCATTATTTAACCCTTTTGAGAACCCTTTTGAAGCCTCTAAGAGGATGAAACTGGATATGGACAGGGGTGAAGGCTGGTGGCTCCCAACATTACATGCTCAGCGGCCACCAGATAATATTTTTGATGGTCTAAAGAATAGTAGTAATGGGTTACTACAGAATCCATTATTAAAATCTAAAGATGGCCGCCGTAATGACACGTGTGAATTCTGCGGAAAAGTTTTCAAAAACTGCTCTAATTTAACTGTTCATCGGAGATCGCATACCGGAGAAAAGCCATATAAATGTGAACTGTGTTCCTACGCATGTGCACAGAGTTCCAAACTGACAAGGCATATGAAAACTCACGGCCGGCTCGGGAAAGACGTGTACCGCTGCCGGTTCTGTGAGATGCCATTCTCCGTGCCATCTACCCTCGAGAAGCATATGCGAAAGTGTGTTGTGAATCAAAGTAATGGTGCTTCTCTTGCTTTATCCGATGATTCGAACGCATGTCGTGATGAGGCTTCGTGA

Protein sequence:

>DPOGS208522-PA
MTSVRLEEAWLPAAADADSAAEGGGSSCLPADTLTCGACRKAFALADIVRFIQHKVSSCDKDLTSYHCYSGGPNSDQEDGTRPGQIASGNTSGRRPSLLTARRPPNSRVHTPPLASPSIAPPDLLEDGGASSTPKRLLDEADAGVSTPKRRASTSPMPSSSPDEDIKPKIKQEHMDRTSSPEDHKKSRTEMADAESNTMHSEPSNYVCSTCKARVHSAWRLVQHVQHVHGVKIYVESMPQQLPNKQNHSSSSTSSSSSGCSSTGAPLPPPSLRHHPLLPPPDMHSPFGVGGLLRMPLPGSLPPLAHPSVPPTPLFARPNHHDHRFRMEQLVSEQFRHHGLNLAAAAAAVAANSLPPHQTFPSPADRPPIVPTSLTGRDRPPVSQPLSLEPQLDFYSQRLRQLAGTTSPGAATGNSSSPSPRKHSPPFASPSPSRVGQTPPVGAGPGTVDTPREAIRAQSSISPERRNEPQTADAPPSDRPSSTPPTKRNNDEAIHTCEFCGKKFRFENSLIAHRRIHTGEKPFKCNQCDETFEKNSKLKKHMKAHRAAEGNTEDLESGGDTGEDDSEDDLDDEELEGEEEEENEDGEEVEEAEDLTVSNHSAPSAPPRKQTPAIPAHPPTASVVGELMDKFGLSNIAQYSEAFKQALQESGNSLKWQLAKDRDNNNGPPSEKPNGMPPTAALRLKEEFAKMPPQPHPLFNPFENPFEASKRMKLDMDRGEGWWLPTLHAQRPPDNIFDGLKNSSNGLLQNPLLKSKDGRRNDTCEFCGKVFKNCSNLTVHRRSHTGEKPYKCELCSYACAQSSKLTRHMKTHGRLGKDVYRCRFCEMPFSVPSTLEKHMRKCVVNQSNGASLALSDDSNACRDEAS-