Monarch geneset OGS2.0

DPOGS202368
TranscriptDPOGS202368-TA1182 bp
ProteinDPOGS202368-PA393 aa
Genomic positionDPSCF300104 + 122833-135012
RNAseq coverage59x (Rank: top 68%)
Annotation
HeliconiusHMEL0171683e-7155.68% 
BombyxBGIBMGA013790-TA1e-5451.38% 
DrosophilaCG13442-PA1e-3337.37% 
EBI UniRef50UniRef50_E2B9744e-4150.00%E3 ubiquitin-protein ligase MARCH3 n=3 Tax=Formicidae RepID=E2B974_HARSA
NCBI RefSeqXP_002059439.13e-3339.27%GJ18733 [Drosophila virilis]
NCBI nr blastpgi|3320247001e-4150.54%E3 ubiquitin-protein ligase MARCH3 [Acromyrmex echinatior]
NCBI nr blastxgi|3320247002e-4550.82%E3 ubiquitin-protein ligase MARCH3 [Acromyrmex echinatior]
Group
Gene OntologyGO:00082703e-14zinc ion binding
KEGG pathway 
InterPro domain[202-253] IPR0130831.8e-17Zinc finger, RING/FYVE/PHD-type
[204-250] IPR0110163e-14Zinc finger, RING-CH-type
Orthology groupMCL26127 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202368-TA
ATGGAAGAAAAGGAAAAATCTAATAAGGATGAAGAATCGCCGAAAGAAGAGATAAAAACAAGAGACAAAATCATCAACAACACTCAGGATGCTGGACCATCAAGAGTAGTAAATCTGCCAGGAGCTATACAGCACAGTCCAAGGAACTTTATCACAGGGAGATCCAAATCCGAGCACAGCATTAAACCTTACATCACAAGTGCAGATGTGGCCAACGCTTTGAAAATAACGTCTCTAAGTTTCGGCAACCAAAATCAGACCACAGACAGGGAGCTGAAAATACCAATGCTAGATAAGAATGATGATAAAGAAGACGCAATAACATTCATAGAACTGAAAAAACTTATGGCCTCTTGTTTCAAAATAAGCAAAGACACTAATGAAATAGCAAACGTGAACGTTCACACCGCTATCAGTGGTATAACAAATGACTTAGTCCACAGCATTCCGAATGTGAGCCTGAATAAGACGGATGTAAATGACGTAGTCGTTACCCAAGATTCCAGATCGAATAGAGCGTCTGAAGGTAGGAAGAAGAACTCGGGAAATCTATCAGAGAAATCGGATTTATTGGCTCAAAGAGATTCCTTATCGAGCATTGGATCTAACGTGTGTAGAATTTGTATGACGAGGGGCAAGGAGAGATTGATTTCTCCTTGTAACTGCAAGGGTTCGCTGGCCAATGTTCACCTGTCTTGCCTCCAGCGTTGGCTCAACCAGGTCGGAAGGAATCATTGCGAGCTCTGTGGTTTTAGCTACCCGGCCATCCGCACTCCGAGGTACACCGTACTCCAGGCCCTGAGACTCTGGTTCTGCAACCCTCGTAACAGAAGTCACCTCCAGACCGATTGTCTGATTTTTTGGCTGCTATCCACCGTAACCGCCGGCCTGCTGGCTGTATGCATCGTTGGCACACAGTACTTTATGATTGAGGGTAACAATTTTGGTAAGCTCGACGCGGCTAGAAACGAAGGATTATCACATCGCATAACAGAAACAGCCATTGATTTTTTCATGGGTATTGTTCTCTGTGGTTATACGGTGACTGTATACTTCCTATGGAAGGACCACTACGTTATGTGGAATCGATGGAGACGAGCTAACGTCAACGTCCAGTTATTGTTGAGCCCAGATTCGAACCCTGTACCTTTCGTCCCGAGATCTAGATATAACATTGTCTAA

Protein sequence:

>DPOGS202368-PA
MEEKEKSNKDEESPKEEIKTRDKIINNTQDAGPSRVVNLPGAIQHSPRNFITGRSKSEHSIKPYITSADVANALKITSLSFGNQNQTTDRELKIPMLDKNDDKEDAITFIELKKLMASCFKISKDTNEIANVNVHTAISGITNDLVHSIPNVSLNKTDVNDVVVTQDSRSNRASEGRKKNSGNLSEKSDLLAQRDSLSSIGSNVCRICMTRGKERLISPCNCKGSLANVHLSCLQRWLNQVGRNHCELCGFSYPAIRTPRYTVLQALRLWFCNPRNRSHLQTDCLIFWLLSTVTAGLLAVCIVGTQYFMIEGNNFGKLDAARNEGLSHRITETAIDFFMGIVLCGYTVTVYFLWKDHYVMWNRWRRANVNVQLLLSPDSNPVPFVPRSRYNIV-