Monarch geneset OGS2.0

DPOGS201119
TranscriptDPOGS201119-TA1431 bp
ProteinDPOGS201119-PA476 aa
Genomic positionDPSCF300137 + 224944-237937
RNAseq coverage1363x (Rank: top 9%)
Annotation
HeliconiusHMEL0179800.082.68% 
BombyxBGIBMGA013669-TA0.075.53% 
Drosophiladnr1-PA6e-4934.31% 
EBI UniRef50UniRef50_C3XU738e-8538.92%Putative uncharacterized protein n=3 Tax=Branchiostoma floridae RepID=C3XU73_BRAFL
NCBI RefSeqXP_975394.11e-10543.74%PREDICTED: similar to myosin regulatory light chain interacting protein [Tribolium castaneum]
NCBI nr blastpgi|3407126272e-10744.40%PREDICTED: e3 ubiquitin-protein ligase MYLIP-like [Bombus terrestris]
NCBI nr blastxgi|3407126278e-10644.51%PREDICTED: e3 ubiquitin-protein ligase MYLIP-like [Bombus terrestris]
Group
Gene OntologyGO:00054882.8e-19binding
GO:00055158.4e-05protein binding
KEGG pathway 
InterPro domain[80-188] IPR0197486.5e-22FERM central domain
[4-193] IPR0197499e-22Band 4.1 domain
[77-186] IPR0143522.8e-19FERM/acyl-CoA-binding protein, 3-helical bundle
[6-79] IPR0189791.2e-09FERM, N-terminal
[401-457] IPR0130834.3e-06Zinc finger, RING/FYVE/PHD-type
Orthology groupMCL16022 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201119-TA
ATGTGGTTAGTCAGTCAGCCGAATTCAGTGATTCTGGAGATCAAGGTGGAACCCAATTCCATCGGACAACAGTGTTTGGAAAAAGTTTGCGAGAAGTTGGAGATAGGAGCAGAGGCCGACTACTTCGGACTCCGAGTGTGCTCGGGCTCCGGGCCCGGCAGGTGGCTGAACCTCAGGAACCACCTGGACCCGCACCGCATACCCAGCAGGCGCCTAGACCTGCGGGTCAAGTTCTGGGTCCCGCCGCATCTCCTCATCAACGAGCCCACACGACACCAGTTCTACCTACACGCCAAGCTGGACCTCATCGAGGGGCGGCTGGTGGTGGCCGACCAGGAGGTCGCCAGGAAGATAATAGCCTACATCGCCCAGGCCGAGACCGGAGACTTCGACCCCCAGGCCGCCTCGAACGTGTACGCCGACTGCGACAAGATAGGCCCCCAGGGAGAGAAGCCCGACGACCACGAGGCCAAGATCATGGAGTATCACTGCCAGATAGCGGGCATGAGAGCCTCGCAGGCCGAATATAAACTGTTGAAGGAGATATCTAAACTGGAGTCCTTCGGGGAGGAGATATTCTTCTGCAAGCCGGTGACGCAGAACAACAACGCCCACAACCTGTACAGTCACCTGCTGTACCACAGGCAGCAGGCGGAGGAGCCGCGGACCAGGGACGACCAGGAGATAGACGGGGGAGGAACCGTGGGCTGCCTCACCTCCGCACACACCGGGGCGGGGTGCGGCTGTAGACTTAGCACCTGCGTGGGAGTCGGGCCTAGTGGGATCGTGGTGTACAGGCCTGCCTGTAACGGAGTGCACGACATAGGAGTAGAGAAACAGAGTATTCCCTACACATCCATCCACCGCGCCCAGCCTGTGCGCCGAATCTTCCAGCTGTGCTACGTGTCTGATGAGGGTCACGAGGTCACCCTCCACGTGAAGATGGCCACCTCCAGCCAGGCCGCTGCCCTGTACCGAGCAGTCACTGAGAAACATGCCTTCTACTACTGTGAAACTGTACGAACAGACGTCACGGAACAGTTCATTAGAGATCTGAAGGGGACGATCGCGTCCATCTTCAACGACTCGTCGACCCTCGGGCGCCGCTACGTGTTCGACATCCGGCGCACGTGCCGCGAGGTGCACGACCGAGCGCGCCGCGAGCACTACGCCCGGACCCGCGACAGGACCCAAACCGCTCAGGAGTCCCGCCCTCGCTCCCGCTCCCGCTCGGTGGAGGCGCTGGCGTGCCGCGTGTGTATGGACGCGCCCATAGACACGCTGTTCCTGCCCTGCCGACACGTGCTCTGCTGCGAGCACTGCGCGCCGCGGTGTGAGCGCTGCCCGCTGTGTCGCGGGGAGGTGGACAGGCTGATGCACGTCTTCCTGCCGCTGGAGTACCAGCGGAGTCCCGGGGTTATTATAAAATAG

Protein sequence:

>DPOGS201119-PA
MWLVSQPNSVILEIKVEPNSIGQQCLEKVCEKLEIGAEADYFGLRVCSGSGPGRWLNLRNHLDPHRIPSRRLDLRVKFWVPPHLLINEPTRHQFYLHAKLDLIEGRLVVADQEVARKIIAYIAQAETGDFDPQAASNVYADCDKIGPQGEKPDDHEAKIMEYHCQIAGMRASQAEYKLLKEISKLESFGEEIFFCKPVTQNNNAHNLYSHLLYHRQQAEEPRTRDDQEIDGGGTVGCLTSAHTGAGCGCRLSTCVGVGPSGIVVYRPACNGVHDIGVEKQSIPYTSIHRAQPVRRIFQLCYVSDEGHEVTLHVKMATSSQAAALYRAVTEKHAFYYCETVRTDVTEQFIRDLKGTIASIFNDSSTLGRRYVFDIRRTCREVHDRARREHYARTRDRTQTAQESRPRSRSRSVEALACRVCMDAPIDTLFLPCRHVLCCEHCAPRCERCPLCRGEVDRLMHVFLPLEYQRSPGVIIK-