Monarch geneset OGS2.0

DPOGS214779
TranscriptDPOGS214779-TA1464 bp
ProteinDPOGS214779-PA487 aa
Genomic positionDPSCF300022 + 1598527-1602696
RNAseq coverage13x (Rank: top 83%)
Annotation
Heliconius% 
BombyxBGIBMGA009435-TA2e-5050.57% 
Drosophila% 
EBI UniRef50UniRef50_Q243103e-4531.01%Polyprotein n=12 Tax=Drosophila RepID=Q24310_DROME
NCBI RefSeqXP_002035219.18e-1228.25%GM14584 [Drosophila sechellia]
NCBI nr blastpgi|10307319e-4531.01%polyprotein [Drosophila melanogaster]
NCBI nr blastxgi|10307318e-4529.74%polyprotein [Drosophila melanogaster]
Group
Gene OntologyGO:00082703.2e-06zinc ion binding
GO:00036763.2e-06nucleic acid binding
KEGG pathway 
InterPro domain[267-375] IPR0211096.9e-08Peptidase aspartic
[196-259] IPR0130843.2e-06Zinc finger, CCHC retroviral-type
Orthology groupMCL23329 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214779-TA
ATGTCACAACCATCTACCGCTGTCAATACTGTTAAACTAATTGAATTTGATCCTAACGATCCAGAGGCTGACGTTGAAGGTTGGTGTAATGTCACCGAAATCATAGTGCAAGAAAAGCGTCTTGAGGGTGTGGAATTGCTTATAGCACTTACCAAAGCGCTTAAGGGTCACGCTGCGACTTTCTTAACCAAAATAAACCTCCAAGAGCTAAAATGGAGTACTGTTAAAGATACTTTGTTAGCAAGATTCGCTAAGCCTAAATTCATACAAGACCATTTCGATGATATTTTAAAATTTCAAATTGGTTCTAAAGAAACCGCGCCAGAGTCAGCACTGCGTTTATGGAACTTGATCGAGCGCATACCAAAAGTTGAATTCCCTGAGGAAGTTATTACAGGATTCGTAATATCGGTGTTATGTCAAAAAGACTATGTTATACGAAGAGAATTGACTACACATGTGATTGCTACTCGCGCTCAATTATTTCGGGTTCTAGGTGGTATATCACTGAAGCGACGGTCAGATGTTAATAATGGAAATCAAGAGCCTGACGTAAAGCGGTCACGGATTGACTCGAAATTTCCTGGAAAATGCCATTGGTGTGGAGTGTCTGGTCATAGACAAGCTGACTGTAGAAAGCGAAAGGAAGATATCAACACTGCGAAGATCCAGGATCAATCAAGCTCTACTTCTACACGTGGTCAAGACAATCTAACTGTTTGCTGTTACACGTGTGGGAAGCGTGGTCATGTTTCTACTGCCTGCCCAGAAAAGAAGATGAAGGACGGGATTGAACGAAGGGAAGTCAACCTATGTGGACATCGGCTTTCGAGATCTACCTTAGAGACATCCACTGGTGAGAGATTTCCATTTTTATTTGATAGTGGATCGTCCTGTTCCCTTTTGACTGAGAGTATCCGCGACCGGTTCCCGGGTATTGTACGCAATAATACTGTGTATCTTACTGGTATAGGAGGTGATGAAGTACAATGCACTTCACAAATATTAAGTACAGTAAAAATCAACGATATCTCGGTTGATCTAGTATTCCACGTCATACCTGATTCAGTAATTTCTGTGCCAGTTATAGTAGGTAGAGATATTTTAAATGAGGGTTTTTGCGTTACGATAGATGACGACAAACTTATATTCAGGACCAAGGAGCGCGCGAACTTTTGCGAAAAAAAGAAGGTTGGTCTATTGAATGATAACGAGGCCGAGACGGCGGCTGAAGATGCACAGCATTCACAAATCAATAGTTATGTTGAGTCACCGCCAAATCAAAATGATAGAAGCGATAACTTAGATATTAGAGATGAAATAGTGGGTGCAGACTCCGACACTGCCTCGGCTAATTCTGCAACCCTAACTGCTGGTTCGGATACGCTTAGTGTGTACTCCGATCCCGAAACGTTAGATGTACAATGTGAAGTCGAAACTTATAGTCAACCAGATGGTTCATAA

Protein sequence:

>DPOGS214779-PA
MSQPSTAVNTVKLIEFDPNDPEADVEGWCNVTEIIVQEKRLEGVELLIALTKALKGHAATFLTKINLQELKWSTVKDTLLARFAKPKFIQDHFDDILKFQIGSKETAPESALRLWNLIERIPKVEFPEEVITGFVISVLCQKDYVIRRELTTHVIATRAQLFRVLGGISLKRRSDVNNGNQEPDVKRSRIDSKFPGKCHWCGVSGHRQADCRKRKEDINTAKIQDQSSSTSTRGQDNLTVCCYTCGKRGHVSTACPEKKMKDGIERREVNLCGHRLSRSTLETSTGERFPFLFDSGSSCSLLTESIRDRFPGIVRNNTVYLTGIGGDEVQCTSQILSTVKINDISVDLVFHVIPDSVISVPVIVGRDILNEGFCVTIDDDKLIFRTKERANFCEKKKVGLLNDNEAETAAEDAQHSQINSYVESPPNQNDRSDNLDIRDEIVGADSDTASANSATLTAGSDTLSVYSDPETLDVQCEVETYSQPDGS-