Monarch geneset OGS2.0

DPOGS213722
TranscriptDPOGS213722-TA4332 bp
ProteinDPOGS213722-PA1443 aa
Genomic positionDPSCF300310 + 29146-37668
RNAseq coverage389x (Rank: top 31%)
Annotation
HeliconiusHMEL0044500.074.95% 
BombyxBGIBMGA011628-TA0.073.01% 
DrosophilaCG42669-PK2e-2040.17% 
EBI UniRef50UniRef50_D6WHG24e-5242.31%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WHG2_TRICA
NCBI RefSeqXP_975321.17e-5342.07%PREDICTED: similar to CG33232 CG33232-PC [Tribolium castaneum]
NCBI nr blastpgi|2700042141e-5142.31%hypothetical protein TcasGA2_TC003538 [Tribolium castaneum]
NCBI nr blastxgi|2700042142e-5935.15%hypothetical protein TcasGA2_TC003538 [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL25874 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213722-TA
ATGACCATTCAAATATCACGTACTATTACGAAGACAAGAAACCGTCGATCCCTTGAATCATCTCTTCCTAAAGTCGAAACTCCGTTGCAGTCTTCCAAGGTGTCTCCCAAAGTTAATGCTTCCATACAATTTGGCTTTAAATCAAATCTGACTAAACCTAATCCAGTCAAAAGTACTGAAAAACGAAGAAGTTCTGAAAGTAGCGCTAAACCCAGTTGTATACCGTTATCCAAAAAGACCCCTGACAAAGTAAAACGTAATTCAAACTCACCTTCTAAATCGGAATCTCCAAAAGATACATCAAAAAGTGATAGTACAAGAATTAAACTGGAATCCTCTAGGGGAAGAAGTGTACGCAAAAGCCCTGATAAATCGAAGATAGCTCTTCCTCAAGAAAATTTTCGTAGCGATAAGAAACATCGAGCTCAAAAAGATAATGAAAGTCTTAACGAAAAATATCATAATTTTAAGAAAAGACTGACTAGTTTAGATAATTCTTCAAATGACGGTGCTGATACTAAGAATACAACTAGCAAAATTATTGTAAATGAACATTTGGTAACAAATCCTGACGTGGACCATAATTCGACGACAACAAATATGACTACCCTCCATGAACCAATAAGACCGCATGCAAGCTCTCGACTTTCTCAAGAAATGGATAGCTTAGCTGCTTTGACCAAACAAACATTAGATAGAGTAAACAAATTAACAAATAACTTAAGTAATAGTAAATTACCTGTTTTAGATAGTACGGAAGAAAATAAATACAATCATACTTACAACTACGGTCTCATTAACGATGAACCCCAGACAAATAGGGGAGTGGCAGAGAGATTAAGAGATATAGATACGGCAGCTCAGAGATTGATTGATTTCGAAAAACAAAGTGCTGTGTTTTCTGATCTCAACAACGATTGCTCTAGTCAAAACCGTCGTCTTGATAACTCCTCGTTGTCTCAACACACGCCAGTGTCTATTTTGAAAAGAAAATCTATCCATGAGGAATGTAATATCACCAATCCCTCAAGCCAAGCCATTGCTTCTCCGCCTGTTACCTTCTCTCCAAGTGTTGTTGAACCACGAAATTGTCGTTCGGAGAATAGGCAGCGTCAAGGAATACTAAAGAAACGACGTAGCTTAGATGAATCTCAAGTTGCTAGACGTAGGTCCTGTAGTCCAGAAGTATCATTTGCTGATGACGGATCACCTGATACGTGCAAACCAATTTTAAAAAATCGGAGATCATCCTTGGAAGACGTAGTTCGAAATCGTTCTCCAGACGGACAAATACAAGGGATATTAAAAAGGAAAATGAGCAAGGAAGAGGAACATTTAGCTGATGATGTATCACATGGTTCACCTGAGCCTCATGGTATTTTAAAACGGAAATCTAATTCAAGTTCAAGTAGTAGTACAACCTCATCTCACGTGTCCATTGCCCAAGCGGTGTTATTGGCGGCTGCCGGTGGTGCTGAATTGGTTGATGAGGATAAAGATACGGTACGACCCATACTTAAAAAGAAAAGTTTCTCTGAAGAACGCCCCTCCCCGGATATACTTACTTCAGACACCCCAAAACCTATTTTGAAAAAAAAATTGACTGAGCACGATGATCATGACTTCGAACGTCCGAAGAAACCAATTCTAAAGTCGTCAAAAAAGATATCTGGTGATGACGGACATACTTCTAGTTTCGATTTGAGCGAAGACGACAGAAGTTCCCGTAGGCCTTCACTACTTAGGTCTCGGACGTCTGATCACTCGGGTTCAGAATGCGAAACGGCTGTTAAGCCAATTTTAAAGCAGAGATGCTCTAGTCTCACTAGGGAACGCAGCCAATCTCCCCGCCCACGTTTGTCATTCTGTGCAGACAACGATGTGAATATTAGTGCTTCAAATTTTAGTAGCGATGTGAACGACTTGTTAGCGGCTGGACCGCGGCGAATTGTGAATATCGGTACTGATCCTGAAGAAAATTACCCTAGTGCAGTAATACGGCGAAGAAATCAGAGACCTAAAACGAATATTCGTTCTATCAGCTTGGTGTGTGACGTCAATGATGAATTACTTTCTATTTTAAATAATCGTCGCCTGAAAGTGGAAGAGCAGTGTAACAACGGACAAAATAATATATTAGGAAGGGAGAAATTAAATGAAGGCAACGATCCTAAAACTTTTCCGTCGATCGCTTCGAGAATAAAGACGATGGAAGAGGCACTTACTAAGGATAATATACCGCAAGAACAAGCTTCAATGAAACAAAGAAATCGAGACAAGGAGCGTTTCAAAACGCAACCTATCACAATCGATGAAATGAGATCTGTCGCATCCAGTTTGGAGCCGGGCCAAGCGAATTTCCAAGCGTTCGGCATTGCGGGCTGTAGTTTTCCGACCCATTCCGTCGGGACTGGTGCGTCTCTCTCATCCAGGGGTCTCCTTTCAGAGGAGCCCGAAAGAGATCCTTACACAGAATTTGAATCAACTTCTTACGATTCTAGTCTGTCGAACGCTAAGTTACCAGTCACAGAGGACTACGGTGAAGCAACTTTCTTAGATTTGGAGAAGTTTAGTGCTAATGGTGAAATAAAACAGTCTACTCTAGATGAAATAGAGCAAGAAGTAAACAAAGTGCGTGTAGCTCTGGATGAGGATTGTCGCGCCTTGGACGAGGACGATAGTAACCAAGCCGAGGCTGACAATTGGAGTTTAAACGTTTCTTGTGATAGCGGCGTTTACAATCGCGCGTCTTCCCGCGACTCCGGGCCGCATTCTGGCGAGGAGTTGGGCTTAATTGAGAGCCAAGAAATTAGTGAAAACCACGCCACCAACAGCACCAACGACTGGTCTTCGACATCTATGGAAGAAGGTCTATTCAAAATGGAACGCATACGAAAATCAACTGAAAACGAAGCGCTTCAAACTCCTGAAGATAATGGGAGTGATGACGAAAACAATGATCATTCTCATTTCGCACTCGGCTTAGTGAAAAGCAACAGTGTCGTTGCTCGAGCAAGCATGTGGCAACAGTTACAACAGCAAGCTAAAGGTACGCCAAAGCCGCTTCTTCGTCACAGTCGCTCCAAAGTGAAGGAGGGTCCTTCGATGACGGAGAGCTTTAAAGCCCAGGAAATTAACACCGTTCCGCAAGAAACCCCACTTGCCCCATCTAAAAGCACAGCAAATGTCTTAGATAGAGATGAAGATGCTAAGTTAGATGAGGATGATCCGGCGAAGATGTCGCTGGCTGACAAAATGAAAATGTTCAACACCAAATTGACACACAAGCCGCCGGTAGCCGGTCTGAGGCCGAAGGAAGATAGAGTGCCGAGAGCGTCTAGACTCAGAACCATGCCGGTACTAGCGAGTCAGGTCCAAGAAGCTATGGAACAAAACGAGAGACTAACAAAGTCTCTAACCCACGAGGATGTCCCGAGGAATAATGACTTCCAACTGAAGATGGAACTTTTCCGATCGGCGAGTGCAAAAAACACGTCATTAGAGTATCTCATGAGGCAGAACTCGAAGTTTAGAAGCTTAGACCTCGATGACGACTCGCCATTGGAGCGAGCTCAAAGAATGATAACACCGGAAGTGAGAGGTATACTGAAGTCGGGATCTACGGTTGTACCTTCGAAGCCGAAAATTTTGGCGAAAGGAGAAAGTTCTGAGGGTCTCAAGGACGAGGGAATAGACAGTTCCTCGGACGAAGAATCCGCATCAGCATCCAGTGTTTCATCTTCAGAGAAAAGCTGCTCATCATCAGGTAGTTCAGATGAGATGCCAGGACCGAAGAGGAGGTTTCAGCGAAAGAACAACAAGCTGAAGTCGTCGCGAACTGACTCAGATCTCACGAAACTCCAAGATCCATATCCTCGTAAGATCCAACTGCCGTGCGCCAGTGAATTAAAAGAACGTATAGCGCAAGCGAAAAACCCTGATGTCAAAATACCATTGCTGGGGAAACTCGGCAAGAAACCATCAGAAGAAGCCAGCGGAGAGGATGACAAGCTGAGATACAAAAGATTCGTCAAGAAACTCGACGAGCCTCTGCAACTGGGGAAGTTGAGGCGGCCGATGGAGAAAACGTCGTCGCTGGAAGAAAAACCGCCCGTGCTGAAGATAACGGCATTAAATCAAGCTGCCAAAAACAAATTCTTTGGTGTCGAAGACAAGAAAGAGAAGAGCACCGATGAATTGACGGCTGTGGTCAGAAAATATATTCCGGTAAATTCAGTGTCACGTCACAGTATAGAGGGCGGTTCGGGTAGTGACGGGGAGAGCAGCGGCGGGCGGGAGGTGCGACACATCAACACTAGAAGGCAGAGGTAA

Protein sequence:

>DPOGS213722-PA
MTIQISRTITKTRNRRSLESSLPKVETPLQSSKVSPKVNASIQFGFKSNLTKPNPVKSTEKRRSSESSAKPSCIPLSKKTPDKVKRNSNSPSKSESPKDTSKSDSTRIKLESSRGRSVRKSPDKSKIALPQENFRSDKKHRAQKDNESLNEKYHNFKKRLTSLDNSSNDGADTKNTTSKIIVNEHLVTNPDVDHNSTTTNMTTLHEPIRPHASSRLSQEMDSLAALTKQTLDRVNKLTNNLSNSKLPVLDSTEENKYNHTYNYGLINDEPQTNRGVAERLRDIDTAAQRLIDFEKQSAVFSDLNNDCSSQNRRLDNSSLSQHTPVSILKRKSIHEECNITNPSSQAIASPPVTFSPSVVEPRNCRSENRQRQGILKKRRSLDESQVARRRSCSPEVSFADDGSPDTCKPILKNRRSSLEDVVRNRSPDGQIQGILKRKMSKEEEHLADDVSHGSPEPHGILKRKSNSSSSSSTTSSHVSIAQAVLLAAAGGAELVDEDKDTVRPILKKKSFSEERPSPDILTSDTPKPILKKKLTEHDDHDFERPKKPILKSSKKISGDDGHTSSFDLSEDDRSSRRPSLLRSRTSDHSGSECETAVKPILKQRCSSLTRERSQSPRPRLSFCADNDVNISASNFSSDVNDLLAAGPRRIVNIGTDPEENYPSAVIRRRNQRPKTNIRSISLVCDVNDELLSILNNRRLKVEEQCNNGQNNILGREKLNEGNDPKTFPSIASRIKTMEEALTKDNIPQEQASMKQRNRDKERFKTQPITIDEMRSVASSLEPGQANFQAFGIAGCSFPTHSVGTGASLSSRGLLSEEPERDPYTEFESTSYDSSLSNAKLPVTEDYGEATFLDLEKFSANGEIKQSTLDEIEQEVNKVRVALDEDCRALDEDDSNQAEADNWSLNVSCDSGVYNRASSRDSGPHSGEELGLIESQEISENHATNSTNDWSSTSMEEGLFKMERIRKSTENEALQTPEDNGSDDENNDHSHFALGLVKSNSVVARASMWQQLQQQAKGTPKPLLRHSRSKVKEGPSMTESFKAQEINTVPQETPLAPSKSTANVLDRDEDAKLDEDDPAKMSLADKMKMFNTKLTHKPPVAGLRPKEDRVPRASRLRTMPVLASQVQEAMEQNERLTKSLTHEDVPRNNDFQLKMELFRSASAKNTSLEYLMRQNSKFRSLDLDDDSPLERAQRMITPEVRGILKSGSTVVPSKPKILAKGESSEGLKDEGIDSSSDEESASASSVSSSEKSCSSSGSSDEMPGPKRRFQRKNNKLKSSRTDSDLTKLQDPYPRKIQLPCASELKERIAQAKNPDVKIPLLGKLGKKPSEEASGEDDKLRYKRFVKKLDEPLQLGKLRRPMEKTSSLEEKPPVLKITALNQAAKNKFFGVEDKKEKSTDELTAVVRKYIPVNSVSRHSIEGGSGSDGESSGGREVRHINTRRQR-