Monarch geneset OGS2.0

DPOGS211960
TranscriptDPOGS211960-TA2718 bp
ProteinDPOGS211960-PA905 aa
Genomic positionDPSCF300011 + 1074347-1082282
RNAseq coverage9x (Rank: top 85%)
Annotation
Heliconius% 
BombyxBGIBMGA000909-TA5e-18050.55% 
Drosophilabtv-PD2e-7928.39% 
EBI UniRef50UniRef50_D1ZZU45e-9730.44%Putative uncharacterized protein GLEAN_08109 n=3 Tax=cellular organisms RepID=D1ZZU4_TRICA
NCBI RefSeqXP_975018.11e-9730.44%PREDICTED: similar to dynein heavy chain isotype 1B [Tribolium castaneum]
NCBI nr blastpgi|2700059762e-9630.44%hypothetical protein TcasGA2_TC008109 [Tribolium castaneum]
NCBI nr blastxgi|2700059762e-10330.65%hypothetical protein TcasGA2_TC008109 [Tribolium castaneum]
Group
KEGG pathwaytca:6638963e-97 
 K10414 (DYNC2H, DNCH2)maps-> Phagosome
    Vasopressin-regulated water reabsorption
InterPro domain[144-594] IPR0135942.2e-31Dynein heavy chain, N-terminal domain-1
Orthology groupMCL26310 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211960-TA
ATGAGTGAAATCCGAGCATATATATCAAAAGCGACCGAAAACTTCTTCAACGTGCCGACCCTAAAGCTAAGCGAGAGCAGCGAAGGAGTTCTGATTGATTTCATTCATAACCCATTAGTGTATCTGCTTCAAGCATGTGCATCAGAACAAAGACAAATCATGTTATATGCAGACGTTCGAGTGAGTGCAAACAAATCCATCATATTTTACAAGACCAGTGCCGTGGAACTAACGGGGTCAGACGCTTTGAACGACCTGAACATCATCACACTGACCACGGGTGCGGCGGAGTCTCTCTATCAGATCATAAGACATGTGTACACTCCGATGCTCACCATCGGTGATGATTTATTTTCTATTAAACTTCAAAAAACTTTACTAGAATTAGAATCGAATCTGAAGCTCGTCACTCACGGCGAGGGAGACGAAAATATTAAAGTTATTTTATCCGTTGAGGACGAAGTTGGTTATTGGAAGACCTTTGGAGAGAAGAGGGACATTAAGAAGAGTGAGAGGGAAGCAGCCTCCGCTTTCTGTGTTCTGTTCGAGGATATCTGTGAAGAGATCAGGTGTTTGCCCTCTATCGGTCTGCAGGAAGTGCGAGACTCCGCTGAGAACATCGGAGGTATACTGGACGACGTGTGGCGGTACTCTCCCACGCCGTACTCACAGGACAGGATGGTCCATATTTTTGATATCGTCGGGCACGTGATCTGCTCGGTCACTCAGCAGGCCGTATCCAGAACGGATTTGTGGCGGGTCCACCACGGTCTGAAGGATAATGAGATACTCCATCAACTAACAGAAGCTCTGGCTGCTGTGAAGGTGTGGGTCAGTACGTGTAAGACCCTCACGGACACGTACTGGCCGAACTACTCACTCCATGAATGGAAAGGAAAACCTTATGTGCCCGTATTCTGCCAGAATTTTCAAAAGAGACTTGAAGAAATTCACAGTATTAGGTCCACTTTTAACCAACTCAGTAAACTCTTGGCGAAATCTGAGAGGACTGAATTAAATAGCGACCAGCTATTGGAGCCCTTCAAGAATATAAATGTTTGGATATACAATGGGCACAATCAAATGTGGGAAAATGCAGTTTCAAGATTTTCGTCAAGTATCCGTCCAGCTGAAGCGAAGATAGCCGAGAAACTCAAACCGCGGCTACAGAATTTATCCACCAAGCAGTCCCTGTACGAGTTCTCAAGATACCGCACCTTGCTCAGTCGACCGCTAGTGCAGCAAGCTCTGACCCGAGAGCTGGAACTATTCCTGTCGTCGCTACTGACGATGATGAAGGACGTTAAATCACACTTGGAGGAAGACCTGCCGGGGCTGTACCACCCGCCCGAGATGACAGACCTGGTCGTCAAGGTGCAGTGGGCGAGGCAGATGGAGGACAAGGTGAAAGAAATAGAATCATGCGTCGGAACCGATCTCAGGAATTTGGAGGGAAGTGATGAGGTGCTGCAGCTGGCTGCCAAAGTGCAAAACGATCTGAAGAATACATACACGCAGCTATATGAGGAGTGGTCCAGAGACGTTCAGGCGCAGCTCAGGGCGGGGTCTCTTCAGCTGTCGGAGCGGCCCGTGGTAGAATTCTCTAGTGCTGACCGCCTCATGGTCGTCAACTATCCCGAGGGACTGGAGCGCGTTGAGCGAGAGGCGCGCGCACTGCTTGCAGCCGGGCTGCCGCCACCACCTGGCGCGCTCACAGGGCTCACGGCATCGCTACGATATGCGAGAGCGCTACACCAGGTGGCTTCTTTTCACAACACGCTGGGTGAACGTGCCGTGAGCTCAACACGGCCCATGTTGTTGCATGCAGCACTCCAGCTGGCGGCTCTGGTGGCAGACCACCGTCCCCCGTCTTGGACCGACGAGCGAGCTCTACATGAATACACACAGCAACTTAAGGAAAAAGTGATGGAGCTCGAGAAACAGAATAATTATCTCACCAGCCAACATTTAAAAATCCGAAGTATTGTCGAGAAGCTCATGGACACAGAGCTCCTTGCGAAACTCGCTGAATGGAAGAAAGGCATCAAGGATATCAGGGATATTATTGAGAAGGTGGAAGCCAATGGGTACGAGAATACAGAGATGTGGCGCTCCCACTGGGACCTGCAGCTGTACAAAGCCATGGAGTGTCAGTACATGAAGGCACTGCTGTCATTACATTCTCACTTTCCGGCGCTCAGGGTCGACCTGATTCACAGTATTAGGTCCACTTTTAACCAACTCAGTAAACTCTTGGCGAAATCTGAGAGGACTGAATTAAATAGCGACCAGCTATTGGAGCCCTTCAAGAATATAAATGTTTGGATATACAATGGCCACAACCAAATGTGGGAAAATGCAGTTTCAAGATTTTCGTCAAGTATCCGTCCAGCTGAAGCGAAGATAGCCGAGAAACTCAAACCGCGGCTACAGAATTTATCCACCAAGCAGTCCCTGTACGAGTTCTCAAGATACCGCACCTTGCTCAGTCGACCGCTAGTGCAGCAAGCTCTGACCCGAGAGCTGGAACTATTCCTGTCGTCGCTACTGACGATGATGAAGGACGTTAAATCACACTTGGAGGAAGACCTGCCGGGGCTGTACCACCCGCCCGAGATGACAGACCTGGTCGTCAAGGTGCAGTGGGCGAGGCAGATGGAGGACAAGGTGCGGCGAACATGTCTCAACGCTACTGACACTGCACAAACTGTATCATGA

Protein sequence:

>DPOGS211960-PA
MSEIRAYISKATENFFNVPTLKLSESSEGVLIDFIHNPLVYLLQACASEQRQIMLYADVRVSANKSIIFYKTSAVELTGSDALNDLNIITLTTGAAESLYQIIRHVYTPMLTIGDDLFSIKLQKTLLELESNLKLVTHGEGDENIKVILSVEDEVGYWKTFGEKRDIKKSEREAASAFCVLFEDICEEIRCLPSIGLQEVRDSAENIGGILDDVWRYSPTPYSQDRMVHIFDIVGHVICSVTQQAVSRTDLWRVHHGLKDNEILHQLTEALAAVKVWVSTCKTLTDTYWPNYSLHEWKGKPYVPVFCQNFQKRLEEIHSIRSTFNQLSKLLAKSERTELNSDQLLEPFKNINVWIYNGHNQMWENAVSRFSSSIRPAEAKIAEKLKPRLQNLSTKQSLYEFSRYRTLLSRPLVQQALTRELELFLSSLLTMMKDVKSHLEEDLPGLYHPPEMTDLVVKVQWARQMEDKVKEIESCVGTDLRNLEGSDEVLQLAAKVQNDLKNTYTQLYEEWSRDVQAQLRAGSLQLSERPVVEFSSADRLMVVNYPEGLERVEREARALLAAGLPPPPGALTGLTASLRYARALHQVASFHNTLGERAVSSTRPMLLHAALQLAALVADHRPPSWTDERALHEYTQQLKEKVMELEKQNNYLTSQHLKIRSIVEKLMDTELLAKLAEWKKGIKDIRDIIEKVEANGYENTEMWRSHWDLQLYKAMECQYMKALLSLHSHFPALRVDLIHSIRSTFNQLSKLLAKSERTELNSDQLLEPFKNINVWIYNGHNQMWENAVSRFSSSIRPAEAKIAEKLKPRLQNLSTKQSLYEFSRYRTLLSRPLVQQALTRELELFLSSLLTMMKDVKSHLEEDLPGLYHPPEMTDLVVKVQWARQMEDKVRRTCLNATDTAQTVS-