Monarch geneset OGS2.0

DPOGS206645
TranscriptDPOGS206645-TA1605 bp
ProteinDPOGS206645-PA534 aa
Genomic positionDPSCF300048 - 363623-365227
RNAseq coverage152x (Rank: top 53%)
Annotation
HeliconiusHMEL0111380.083.93% 
BombyxBGIBMGA008342-TA0.078.88% 
DrosophilaAPP-BP1-PA1e-13444.98% 
EBI UniRef50UniRef50_E0W0X02e-16951.95%NEDD8-activating enzyme E1 regulatory subunit, putative n=11 Tax=Pancrustacea RepID=E0W0X0_PEDHC
NCBI RefSeqXP_002432014.14e-17051.95%NEDD8-activating enzyme E1 regulatory subunit, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3407291587e-17153.25%PREDICTED: NEDD8-activating enzyme E1 regulatory subunit-like [Bombus terrestris]
NCBI nr blastxgi|3407291582e-16653.25%PREDICTED: NEDD8-activating enzyme E1 regulatory subunit-like [Bombus terrestris]
Group
Gene OntologyGO:00054882.2e-79binding
KEGG pathwayphu:Phum_PHUM5647201e-169 
 K04532 (APPBP1)maps-> Alzheimer's disease
InterPro domain[11-534] IPR0090365.6e-122Molybdenum cofactor biosynthesis, MoeB
[471-532] IPR0160402.2e-79NAD(P)-binding domain
Orthology groupMCL11747 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206645-TA
ATGGCCTCTCCCTCACCTAAATCGCCGGAACAAAGTGAAAAGAACAAACAGTACGACCGGCAACTGAGGCTTTGGGGAGATCACGGTCAAAAAGCTCTTGAAAAGGGTCATATTTGTCTCATAAATGCTACAGCACTTGGTACTGAAATATTGAAGTCAATAGTTCTTCCTGGAGTTGGAGCAATTACTCTTGTTGATCATAATATCGTTAGTGATGAAGATATTGGATGTAACTTTTTCTTAGAAAATATAAGCCAAGGGCTTAATAGAGGAGCAGAAACGCTACGACTATTATTAGAATTAAATTCGTCTGTTCAAGGTCATGCAGTACAAGAGCCTCCTGAACAGATTCTGCAAGACAACCCAGATTTTTTTAAATCTTTTAGTGTAGTTATTGCAACATCTCTAGGTGAGAAAACAACACAGGATTTAGCAAATCATTTATGGGACATTAAGGTTCCTTTTATATTATGCAGATCTGTAGGATTTTTGGGTTCTTTTAGAATACAGATTAATGAGCACCCAATCATAGAAGTTCATCCTGAAAATGAACAGATTGATTTACGTTTAGATGTCCCATTTCCTACATTGGCTGAATATTTGAATGCATTTAATATAGATGATCTTGATTTAAAGGATCATGGACATGTTCCTTGGATTGTTATTCTATATAAAGCAATCCAGAAATGGCAAGTTACAAATGAAAACAAGTGGCCGATCAGTAGGAAGGAGAAAAATGAAATTAAAGATATTATCAGAGGTTTTATTAGGAAGGATGAGAATAGTATACCTATTGAAGAGGAAAACTTTGAAGAAGCCTTAAGAGCCGTCAATACTGCATTGGTCCCAACATTTCTGCCCGTAAATATTAAGGAACTAATATACAGTAGTTCAGCAACAAACTTAACAAAAGACAGCTCATCATTTTGGATCATGTGTTCTGCTTTGAGAGCATTCATTGAAGCAGAAGGAAAAGGAAAGTTACCCCTCCGAGGTGTATTACCAGATATGACGGCTTCTACAGAACATTATGTGAAACTTCAAAGTATGTACAGAACTCAAGCTGCTATTGAAGCCGAGATAGTTTACCGTAAAGTTCAAGAAATAGTGGCACAATTACATTGTGATAGTATAAGTGATGCTGAAGTAAAACTATTTTGTAGACATGCGTATGATCTGCATTTGATAAGAGGCACTAATATATCATCTGAGTATCAAATGGGAACTGTGGCTTCATACATAGCAGGTTATTTAGAGGAACCAGATGTCATGATGGTACATTATATACTATTACGAGCAGTGGACATGTTCAGATCAGAGCATTGCAGAGCACCAGGTGAATGGGAACCAGAAGCCGATATATCTAAATTAAAAACCTGTGTCTCCAAACTATTAAGTGATATATCATGTTCACCATTCCCTAAAGACGATCATATTCATGAAATGTGTAGATATGGTGGAGCTGAAATTCACAGTGTCTCAGCCTTCCTAGGGGGTTGCATAGCCCATGAAGCAATCAAAATTGTAACAAAGCAGTATAAACCTGTTAATAATACATTTATTTATGATGGAGCCTCTACAAACTCAGCCACATTTACATTTTAG

Protein sequence:

>DPOGS206645-PA
MASPSPKSPEQSEKNKQYDRQLRLWGDHGQKALEKGHICLINATALGTEILKSIVLPGVGAITLVDHNIVSDEDIGCNFFLENISQGLNRGAETLRLLLELNSSVQGHAVQEPPEQILQDNPDFFKSFSVVIATSLGEKTTQDLANHLWDIKVPFILCRSVGFLGSFRIQINEHPIIEVHPENEQIDLRLDVPFPTLAEYLNAFNIDDLDLKDHGHVPWIVILYKAIQKWQVTNENKWPISRKEKNEIKDIIRGFIRKDENSIPIEEENFEEALRAVNTALVPTFLPVNIKELIYSSSATNLTKDSSSFWIMCSALRAFIEAEGKGKLPLRGVLPDMTASTEHYVKLQSMYRTQAAIEAEIVYRKVQEIVAQLHCDSISDAEVKLFCRHAYDLHLIRGTNISSEYQMGTVASYIAGYLEEPDVMMVHYILLRAVDMFRSEHCRAPGEWEPEADISKLKTCVSKLLSDISCSPFPKDDHIHEMCRYGGAEIHSVSAFLGGCIAHEAIKIVTKQYKPVNNTFIYDGASTNSATFTF-