Monarch geneset OGS2.0

DPOGS206915
TranscriptDPOGS206915-TA1899 bp
ProteinDPOGS206915-PA632 aa
Genomic positionDPSCF300001 - 1442792-1445389
RNAseq coverage91x (Rank: top 63%)
Annotation
HeliconiusHMEL0143470.068.20% 
BombyxBGIBMGA014521-TA0.063.41% 
DrosophilaSas-4-PA2e-5956.11% 
EBI UniRef50UniRef50_UPI00022C9EFD2e-5955.80%UPI00022C9EFD related cluster n=1 Tax=unknown RepID=UPI00022C9EFD
NCBI RefSeqXP_002137953.19e-6056.67%GA27499 [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastpgi|3504146959e-5955.80%PREDICTED: hypothetical protein LOC100749532 [Bombus impatiens]
NCBI nr blastxgi|3479667683e-9031.70%AGAP001897-PA [Anopheles gambiae str. PEST]
Group
KEGG pathway 
InterPro domain[452-628] IPR0098529.3e-70T-complex 10/CenJ, C-terminal
Orthology groupMCL17894 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206915-TA
ATGGTTGAGATGAATGCAAAGTTAGTTGCTACAAGTGAATTATTGAAAGATAGACTTCGTGAGCTGGAAGATGAAATAGAAACATTCAGGAAAGAGAATGCTAACCTTAACAAGATGAGAGAAGATATAGAACTTGAGCGTCAAAAGTTTTATGAGGAGAAAAGTCTGATTGAACAGAAGTTTAATGAGGAGAAAATTTTATCTGAATATTATTTAGCAGAAGAGAAAGAAAAATTAAGTAAACAAAAACAAATGTATGAAAGATATGTAAAAGATATGAGGGGTAGATTAAATAAAAAAGAAAAAGACGAGGTTATTAATTTGAAGAAAGAAATTGTTGACTTGAAGGAAGAAATAAGATCGAAAGACGCAAAATCAACATCTACAATTGCTAGGCTTAGAAATCAATTAAAGATTATGGAAAAAGAGAAAAAAAATTTTGAAGACGAGATTGAAAAATTAAAAAAAGAGAATCGACGAATTCAGCACAGTAATGATATATCTAGAAGGCTAACAAATATCAAGTATTTGGAAGAAATAAATAAAAAACTAAATAATATGACCTCGAGAGATGCACATTCTGATATTGTTTATGATCCTGACATGAAATATAAAGCGTATGAAATTGAACGACAAAGCAGAACCAGAAAAAACCTTCCCGCAAATAAAGGTACCATTAGACCACGAGCAAAAAGTGTTCCGAACCTCAAAGTAACGTCAAGGTATGCTAAATATTTTAGCCAGAAAGATTCATTGAGTGAAAAAGAGAAAAATAGAACATTTCATAATGAACATGTCCAAATATATTCAAACTCTTCAAATAACGATATATCAATTAGTGACGAGGAAAACAATTACGACGATTATGAAAATAAAACTTCCGATAGTGAAATCAGTAACAATTTAGAGAAAATATACAATACAAGATTTAAATCAGTATCACCATTATCTAAAAGCCATGAAAACTTTGACCAGCCTGATAATAATGATTTTTTCCTTAGAAAAGGAACTTCAGCTCGGAGTTTATCTAAAAATTCGATTTCCCCTAATCGAACGAACTATGATTCGTCAAAGTCTGCTCCAAATGTTAACCACAGAAATACATTGAGTAGTAGAAATTCAAATAGTAGCAAATCACCAAGTAGTATTTACCAGGAAAATTACGAGCAACAAGAATGTCGAAGATCCAAGTCACCAGTGTCAATATTAAGTAATAAGTCTTCATCTAGCCAAAAATGTGGCACAGTTATAAACAAAGAAACATATCTAAATTTGAATATGATACCTAGTCCAGAACCGACAGTTAGTAAAAGCAGTTTAACAAAAACTAATTTAAATCCAACGGAAGTTAGAAAGCCAGATGGTACTAAAGAAATAAGATTCCCTAATGGAAACATTAAAATAATATCAGCAGATGGAAAATATAGTAAATTTGTGTATTACAATGGAGATGTGAAAGAAAATTTCTATAATGAGGGTAGGATAAAGTATTATTATGCTGAAACTAAGACATTCCATACTACGCATCCTGATGGTTTAGAAGTTTTAGAGTTCCCTGACGGTCAAGTTGAGAAGCGTTACAGAGACGGTTCTACAGAAATCAGACTTCCAAATGGCAGTATACGTTACTTCGATCCTAAGAACGAGCATGTGCGAGAGGAGTGGCGTTTCCCAGACGGCGCCGCTCTTACGGTCTCCGCTAATGGGGAAAAGAGAATTGTTTTTGCTAATGGACAGATAGAAGTCCATGCAAAAGAACACAAGAGACGAGAATTTCCTGATGGCACAGTTAAATTGGTTTATAATGATGGTACATCTGAGACAAGATATTCTTCAGGCCGAGTCAGGATTAAAGATAAACATGGAAATCTCCTTATGGACTCTGCTACCAGACAATGA

Protein sequence:

>DPOGS206915-PA
MVEMNAKLVATSELLKDRLRELEDEIETFRKENANLNKMREDIELERQKFYEEKSLIEQKFNEEKILSEYYLAEEKEKLSKQKQMYERYVKDMRGRLNKKEKDEVINLKKEIVDLKEEIRSKDAKSTSTIARLRNQLKIMEKEKKNFEDEIEKLKKENRRIQHSNDISRRLTNIKYLEEINKKLNNMTSRDAHSDIVYDPDMKYKAYEIERQSRTRKNLPANKGTIRPRAKSVPNLKVTSRYAKYFSQKDSLSEKEKNRTFHNEHVQIYSNSSNNDISISDEENNYDDYENKTSDSEISNNLEKIYNTRFKSVSPLSKSHENFDQPDNNDFFLRKGTSARSLSKNSISPNRTNYDSSKSAPNVNHRNTLSSRNSNSSKSPSSIYQENYEQQECRRSKSPVSILSNKSSSSQKCGTVINKETYLNLNMIPSPEPTVSKSSLTKTNLNPTEVRKPDGTKEIRFPNGNIKIISADGKYSKFVYYNGDVKENFYNEGRIKYYYAETKTFHTTHPDGLEVLEFPDGQVEKRYRDGSTEIRLPNGSIRYFDPKNEHVREEWRFPDGAALTVSANGEKRIVFANGQIEVHAKEHKRREFPDGTVKLVYNDGTSETRYSSGRVRIKDKHGNLLMDSATRQ-